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Secreted Proteins 



FIELD OF THE INVENTION 

The invention relates to secreted proteins and their precursors, isolated polynucleotides 
encoding such polypeptides, polymorphic variants thereof, and the use of said nucleic acids and 
proteins or compositions thereof in detection assays, for disease diagnosis, for disease treatment 
and for drug development 

BACKGROUND 

Proteins from the phosphatidylethanolarnine-binding protein (PEBP) family arc highly 
conserved throughout nature and have no significant sequence homology with other proteins of 
known structure or function. These proteins are found in a wide variety of neural, peripheral, 
reproductive and other tissues (Banfield MJ, Barker JJ, Perry AC, Brady RL, Function from 
structure? The crystal structure of human phosphaUdylcthanolaminc-bmding protein suggests a 
role in membrane signal transduction, Structure ,(1998), Oct 15;6(10): 1245-54). They are reported 
as cytoplasmic proteins, and have been observed as localised to the inner pcriplasmic membrane 
(Bantlcld et aL, supra) 

From a structural point of view, the PEBPs comprise a central beta sheet with highly 
conserved residues, which contains a binding pocket where a variety of ligands, such as 
phospholipids, opioids, hydrophobic odorant molecules and possibly G protein coupled signalling 
proteins can bind. Despite the widespread expression and the resolution of 3D structures of bovine 
and human PEBP by X-ray crystallography (Scrre L., Vallee B., Burcaud N. f Schoentgcn F. and 
Zclwcr C. Crystal structure of the phosphatidylcthanolamine-binding protein from bovine brain: a 
novel structural class of phospholipid-binding proteins, Structure 1998 Oct 15,6(10): 1255-65 and 
Banfield et al. supra), very little is known about the function or properties of the proteins in this 
family. A possible role for PEBP could be the transduction of membrane signal by regulating 
intracellular signaling cascades (MAP kinase pathway) (Simister PC, Banfield MJ, Brady RL, The 
crystal structure of PEBP-2, a homologuc of the PEBP/RKJP family. Acta Crysia/iogr D Biol 
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Crystatlogr .(2002) Jun;S8(Pt 6 Pt 2):1077-80). A method of inhibiting a protease by contacting it 
with an effective amount of a PEBP family member has been described (PCT patent publication 
WO 02/18623). 

The present invention discloses a new sub-family of proteins which: 

• are secreted 

• have a high level of homology with known PEBP proteins over the ligand binding 
site region 

• have a very different, but conserved, N-ter sequence, which contains the signal 
peptide for secretion. 
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SUMMARY OP THE INVENTION 

The present invention is directed to compositions related secreted proteins, designated 
herein "Pro4 12-related" proteins. Such compositions include Pro412-related proteins, having an 
amino acid sequence of SEQ ID NO: I to 4. Pro412-related proteins precursors, Pro4 12-related 
antibodies, including monoclonal antibodies and other binding compositions derived therefrom, 
and methods of making and using these compositions. 

In another aspect, the invention includes polynucleotides coding for a polypeptide having 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1 through 4, antisensc 
oligonucleotides complementary to such sequences, oligonucleotides complementary to Pro4I2- 
related gene sequences useful in diagnostic and analytical assays, such as primers for polymerase 
chain reactions (PCRs), and vectors for expressing Pro4 12-related proteins. 

The invention further includes methods of using Pro4"l2-relatcd proteins compositions, 
including antisense and antibody compounds, to treat disorders associated with aberrant expression 
of the Pro412-reIaled proteins of SEQ ID NO: 1 to 4 in an individual, and methods of using 
Pro4 12-related proteins compositions, including primers complementary to the Pro412-rclated 
gene and/or messenger RNA and anti-Pro4 12-related antibodies, for detecting and measuring 
quantities of the Pro4 12-related proteins in tissues and biological fluids. The invention also 
further includes methods of screening for compounds that inhibit or increase the expression of 
Pro412-related proteins, as well as methods of screening for compounds that interact with and/or 
inhibit or increase the activity of Pro4I2-rclatcd proteins. The invention further encompasses 
compounds thus identified, as well as compositions thereof. 

In another aspect, the invention includes polypeptides having a sequence which is 
at least 95 percent identical to SEQ ID NO: 1 to 4. Preferably, the invention includes polypeptides 
having at least 97 percent, and more preferably at least 98 percent, and still more preferably at least 
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99 percent, identity with any one of die sequences selected from SEQ ID NO: 1 to 4. Most 
preferably, the invention includes polypeptides having a sequence at least 99 percent identical to 
SEQ ID NO: 1 to 4. 

5 In another aspect, the invention includes an isolated peptide fragment of Pro4 1 2-rela ted 

proteins having a sequence identical to a subsequence of SEQ ID NO: 1 to 4. 

In another aspect, the invention includes an isolated peptide consisting of 6 to 40 amino 
acids whose sequence is identical to a subsequence of consecutive amino acids in the Pro412- 
related protein having the sequence of SEQ ID NO: 1 to 4. Such peptides are useful intermediates 
10 in the production of antigenic compositions used in the production of peptide antibodies specific 
for Pro4l2-rclated proteins. 

In another aspect, the invention includes isolated antibodies specific for any of the 
polypeptides, peptide fragments, or peptides described above. Preferably, the antibodies of the 
invention arc monoclonal antibodies. Such antibodies have diagnostic and therapeutic 
15 applications, particularly in treating Pro4 12-related disorders. Treatment methods include, but arc 
not limited to, those that employ antibodies or antibody-derived compositions specific for a 
Pro4 12-related antigen. Diagnostic methods for detecting Pro41 2-related proteins in specific tissue 
samples, and for detecting levels of expression of Pro4 12-related proteins in tissues, also form part 
of the invention. Compositions comprising one or more antibodies described above, together with 
20 a pharmaceutical^ acceptable carrier are also within the scope of the invention. 

In another aspect, the invention includes an isolated polynucleotide having a sequence at 
least 95 percent indentical to any sequence of 100 to 1000 consecutive nucleotides of a sequence 
that encodes a Pro4 12-related protein having the amino acid sequence of SEQ ID NO: 1 through 4, 

25 or any complementary sequence of the foregoing. More preferably, the invention includes an 

isolated polynucleotide having at least 98 percent, and most preferably at least 99 percent, identity 
with any sequence of 100 to 1000 consecutive nucleotides of a sequence that encodes a Pro4l2- 
rclatcd protein having the amino acid sequence of SEQ ID NO: 1 through 4, or any complementary 
sequence of the foregoing. 

30 In another aspect, the invention includes an isolated polynucleotide that forms a detectable 

duplex under stringent hybridization condiuons with a polynucleotide having any sequence of 100 
to 1000 consecutive nucleotides of a sequence that encodes a Pro4 12-related protein having the 
amino acid sequence of SEQ ID NO: 1 through 4, or any complementary sequence of the 
foregoing 

3S In another aspect, the invention includes an isolated polynucleotide that encodes Pro412- 

relaied proteins having the amino acid sequence of SEQ ID NO: 1 to 4. 

In another aspect, the invention includes primer pairs for carrying out a PCR to amplify a 
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segment of a polynucleotide of the invention. Each primer of a pair is an oligonucleotide Having a 
length of between 15 and 30 nucleotides such that i) one primer of the pair forms a perfectly 
matched duplex with one strand of a polynucleotide of the invention and the other primer of the 
pair forms a perfectly matched duplex with the complementary strand of the same polynucleotide, 
5 and ii) the primers of a pair form such perfectly matched duplexes at sites on the polynucleotide 
that are separated by a distance of between 10 and 2500 nucleotides. Preferably, the annealing 
temperature of each primer of a pair with its respective complementary sequence is substantially 
the same. 

In another aspect, the invention includes natural variants of the Pro41 2-related protein 

10 having a frequency in a selected population of at least two percent More preferably, such natural 
variant has a frequency in a selected population of at least five percent, and still more preferably, 
at least ten percent Most preferably, such natural variant has a frequency in a selected population 
of at least twenty percent The selected population may be any recognized population of study in 
the field of population genetics. Preferably, the selected population is Caucasian, Negroid, or 

15 Asian. More preferably, the selected population is French, German, English, Spanish, Swiss, 

Japanese, Chinese, Irish, Korean, Singaporean, Icelandic, North American, Israeli, Arab, Turkish, 
Greek, Italian, Polish, Pacific Islander, Finnish, Norwegian, Swedish, Estonian, Austrian,, or 
Indian. More preferably, the selected population is Icelandic, Saami, Finnish, French of Caucasian 
ancestry, Swiss, Singaporean of Chinese ancestry, Korean, Japanese, Qucbccian, North American 

20 Pima Indians, Pennsylvania Amish and Amish Mcnnonitc, Newfoundlander, or Polynesian. 

In another aspect, the invention provides a vector comprising DNA encoding a Pro4l2- 
rclated protein. The invention also includes host ceils and transgenic non human animals 
comprising such a vector. There is also provided a method of making a Pro4 1 2-related proteins or 
Pro4l2-rclated proteins precursor. The method comprises the steps of (a) providing a host cell 

25 containing an expression vector as disclosed above; (b) culturing the host cell under conditions 
whereby the DNA segment is expressed; and (c) recovering the protein encoded by the DNA 
segment Within one embodiment the expression vector further comprises a secretory signal 
sequence opcrably linked to the DNA segment, the cell secretes the protein into a culture medium, 
and the protein is recovered from the medium. 

30 

In still a further aspect, the invention includes pharmaceutical compositions and 
formulations comprising a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 to 4, and a pharmaceutical ly acceptable carrier compound. 

35 Further aspects of the invention are also decribed in the specification and in the claims. 
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BRIEF DESCRIPTION OF THE SEQUENCE L1STINH 



SEQID 


content 


i 
1 


entire protein Q96S96 Cerase' form) 


2 


mature peptide 




entire protein Q8WW74 Tcrasc 1 fornri 




A 
** 


mature peptide 


c 

J 


genomic seq. 1 (NT 008300.6 fragment) 1 


6 


genomic seq.2(NT 008300.6 fragment) 


7 


genomic seq3(NT 008300.6 fraRment) 


8 


genomic seq.4(NT 008300.6 fragment) 




genomic seq. for seqID 1 and seqLD 2 
(AC105046.2 fragment) 


10 


exon 1 


11 


exon 2 


12 


exon 3 


13 


exon 4 


14 


exon 5 




15 


exon 6 for seqID 3 and seqID 4 


16 


exon 6 for seqID 1 and seqID 2 


17 


cDNA for seqID 1 and seqID 2 
(EST BM984582) 


18 


cDNA for seqID 3 and seqID 4 
(EST A1857902) 



BRIEF DES CRIPTrOTSf OF THE PICIIRKS 

Figure 1 shows the sequence of one Pro412-rclalcd protein of the invention, from SEQ ID 
NO 1. The tryptic peptides observed by tandem mass spectrometry are bold and underlined. The 
tryptic peptides observed by MALDI mass spectrometry are in italics and underlined. The signal 
peptide has been underlined. 

Figure 2 illustrates the structural homology between the C-term part of the Pro412-related 
proteins of the invention and the proteins from the PEBP family: pdblbeh designates the human 
PEBP protein and pdbla44 designates the bovine PEBP protein. The structure of both proteins has 
been solved by X-ray crystallography (Banfield et al. supra and Scire et a!., supra). Q96S96 
designates one Pro412-related protein of the invention, from SEQ ID NO 2. The secondary 
structural elements which are predicted to be conserved between the known PEBP proteins and the 
Pro412-relatcd protein of the invention have been identified above the corresponding stretches of 
sequence. The cis Glu from the known protein which is mutated to Phe herein has been 
highlighted. 

Figure 3 illustrates mainly the conserved N-term part of the Pro4l2-related proteins of the 
invention among species. Q96S96 and Q8WW74 correspond to two Prn4 12-rclated proteins of the 
invention, from SEQ ID NO 2 and 4, respectively, and Q9D9G2 and Q9D9L9 are two mouse 
homologs of the Pro4 12-rclated proteins of the invention. The 4 conserved Cys residues have been 
highlighted, as well as the residue Phe in human and Tyr in mouse which correspond, within the 
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PEBP-like active site, to the Glu83 residue which is found to be in a Cis conformation in human 
PEBP. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

As used herein, the term "nucleic acids" and "nucleic acid molecule" is intended lo include 
DNA molecules (c.g^ cDNA or genomic DNA) and RNA molecules (eg., mRNA) and analogs of 
the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single- 
stranded or double-stranded, but preferably is double-stranded DNA. Throughout the present 
specification, the expression "nucleotide sequence" may be employed to designate indifferently a 
polynucleotide or a nucleic acid. More precisely, the expression "nucleotide sequence" 
encompasses the nucleic material itself and is thus not restricted to the sequence information (i.e. 
the succession of letters chosen among the four base letters) that biochemically characterizes a 
specific DNA or RNA molecule. Also, used interchangeably herein are terms "nucleic acids", 
15 "oligonucleotides", and "polynucleotides". 

An "isolated" nucleic acid molecule is one which is separated from other nucleic acid 
molecules which arc present in the natural source or the nucleic acid. Preferably, an "isolated" 
nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and 3' ends of the nucleic acid) in the genomic DNA or the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated Pro412-rclatcd nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb. 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide 
sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which 
the nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or substantially free of chemical precursors or other chemicals when 
chemically synthesized. A nucleic acid molecule of the present invention, or a portion thereof, can 
be isolated using standard molecular biology techniques and the sequence information provided 
herein. Using all or a portion of a nucleic acid of the present invention, as a hybridization probe, 
Pro412-rclatcd nucleic acid molecules can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook, J., Fntsh, E. b\, and Maniatis, T. Molecular Cloning. A 
Laboratory Manual. 2nd, cd., Cold Spring Harbor Laboratory. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1989). 
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A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
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and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
Pro4 1 2-related nucleotide sequences can be prepared by standard synthetic techniques, e.g., using 
an automated DNA synthesizer. 

5 As used herein, the term "hybridizes to" is intended to describe conditions for moderate 

stringency or high stringency hybridization, preferably where the hybridization and washing 
conditions permit nucleotide sequences at least 60% homologous to each other to remain 
hybridized to each other. Preferably, the conditions are such that sequences at least about 70%, 
more preferably at least about 80%, even more preferably at least about 85%, 90%, 95% or 98% 

10 homologous to each other typically remain hybridized to each other. Stringent conditions are 

known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John 
Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stnngcnt 
hybridization conditions are as follows: the hybridization step is realized at 65°C in the presence of 
6 x SSC buffer, 5 x Denhardt's solution, 0,5% SDS and lOOug/ml of salmon sperm DNA. The 

15 hybridization step is followed by four washing steps: 

- two washings during 5 min, preferably at 65°C in a 2 x SSC and 0.1 %SDS buffer; 

- one washing during 30 min, preferably at 65°C in a 2 x SSC and 0.1% SDS buffer, 

- one washing during 10 min, preferably at 65°C in a 0.1 x SSC and 0.1%SDS buffer, 
these hybridization conditions being suitable for a nucleic acid molecule of about 20 nucleotides in 

20 length. It will be appreciated that the hybridization conditions described above are to be adapted 
according to the length of the desired nucleic acid, following techniques welt known to the one 
skilled in the art, for example be adapted according to the teachings disclosed in Hames B.D. and 
Higgins S.J. (1985) Nucleic Acid Hybridization: A Practical Approach. Hames and Higgins Ed., 
IRL Press, Oxford; and Current Protocols in Molecular Biolog (supra 

25 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences arc aligned for optimal comparison purposes (e.g., gaps can be introduced in the 
sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino 
or nucleic acid sequence and non-homologous sequences can be disregarded for comparison 

30 purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison 
purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even more 
preferably at least 60%, and even more preferably at least 70%, 80%, 90% or 95% of the length of 
the reference sequence. The amino acid residues or nucleotides at corresponding amino acid 
positions or nucleotide positions are then compared. When a position in the first sequence is 

35 occupied by the same amino acid residue or nucleotide as the corresponding position in the second 
sequence, then the molecules arc homologous at that position (i.e., as used herein amino acid or 
nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology"). The percent 
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homology between the two sequences is a function of the number of identical positions shared by 
the sequences (i e., % homology** of identical positions/total # of positions 100). 

The comparison of sequences and determination of percent homology between two 
sequences can be accomplished using a mathematical algorithm. A preferred, non-limiting example 
of a mathematical algorithim utilized for the comparison of sequences is the algorithm of Karlin 
and Altschul (1990) Proa Natl. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul 
(1993) Proc. Natl. Acad. Sci. USA 90.5873-77, the disclosures of which ate incorporated herein by 
reference in their entireties. Such an algorithm is incorporated into the NBLAST and XBLAST 
programs (version 2.0) of Altschul, etal. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide 
searches can be performed with the NBLAST program, scorc=100, wordlength=l2 to obtain 
nucleotide sequences homologous to Pro412-related nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to 
obtain amino acid sequences homologous to Pro412-related protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et ah, (1 997) Nucleic Acids Research 25(17)3389-3402. When utilizing BLAST and 
Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See http://www.ncbi.nlm.nih.gov, the disclosures of which are 
incorporated herein by reference in their entireties. Another preferred, non-limiting example of a 
mathematical algorithim utilized for the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989), the disclosures of which are incorporated herein by reference in their 
entireties. Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part 
of the GCG sequence alignment software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAMI20 weight residue table, a gap length penalty of 12, and 
a gap penalty of 4 can be used. 

The term "polypeptide" refers to a polymer of amino acids without regard to the length of 
the polymer; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide This term also does not specify or exclude post-expression modifications of 
polypeptides, for example, polypeptides which include the covalent attachment of glycosyl groups, 
acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed by the term 
polypeptide. Also included within the definition are polypeptides which contain one or more 
analogs of an amino acid (including, for example, non-naturally occurring amino acids, amino 
acids which only occur naturally in an unrelated biological system, modified amino acids from 
mammalian systems etc.), polypeptides with substituted linkages, as well as other modifications 
known in the an, both naturally occurring and non-naturally occurring. 

The term '•protein" as used herein may be used synonymously with the term "polypeptide" 
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or may refer to, in addition, a complex of two or more polypeptides which may be linked by bonds 
other than peptide bonds, for example, such polypeptides making up the protein may be linked by 
disulfide bonds. The term "protein" may also comprehend a family of polypeptides having 
identical amino acid sequences but different post-translational modifications, such as 
5 phosphorylations, acylations, glycosylations, and the like, particularly as may be added when such 
proteins are expressed in eukaryotic hosts. 

An "isolated" or "purified" protein or biologically active portion thereof is substantially 
free of cellular material or other contaminating proteins from the cell or tissue source from which 

10 the Pro41 2 -related protein, or a biologically active fragment or homol ogue thereof is derived, or 
substantially free from chemical precursors or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" includes preparations of a protein according to 
the invention (e.g. Pro41 2-related protein, or a biologically active fragment or homologue thereof) 
in which the protein is separated from cellular components of the cells from which it is isolated or 

15 recombinant^ produced. In one embodiment, the language "substantially free of cellular material" 
includes preparations of a protein according to the invention having less than about 30% (by dry 
weight) of protein other than the Pro41 2-related protein (also referred to herein as a 
"contaminating protein"), more preferably less than about 20% of protein other than the protein 
according to the invention, still more preferably less than about 10% of protein other than the 

20 protein according to the invention, and most preferably less than about 5% of protein other than the 
protein according to the invention. When the protein according to the invention or biologically 
active portion thereof is recombinant^ produced, it is also preferably substantially free of culture 
medium, i.e., culture medium represents less than about 20%, more preferably less than about 
10%, and most preferably less than about 5% of the volume of the protein preparation. 

25 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of Pro4i 2-related protein, or a biologically active fragment or homologue thereof in 
which the protein is separated from chemical precursors or other chemicals which are involved in 
the synthesis of the protein. In one embodiment, the language "substantially free of chemical 

30 precursors or other chemicals" includes preparations of a Pro41 2-related protein having less than 
about 30% (by dry weight) of chemical precursors or non-Pro4 1 2-related proteins chemicals, more 
preferably less than about 20% chemical precursors or non-Pro41 2-related proteins chemicals, still 
more preferably less than about 10% chemical precursors or non-Pro4 1 2-related proteins 
chemicals, and most preferably less than about 5% chemical precursors or non-Pro4l2-relatcd 

35 proteins chemicals. 

The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 
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artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 

5 Another aspect of the invention pertains to anti-Pro4 1 2-rel ated antibodies. The term 

"antibody" as used herein refers to immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site which 
specifically binds (immunoreacts with) an antigen, such as a Pro4 12-related protein, or a 
biologically active fragment or homologuc thereof. Examples of immunologically active portions 

10 of immunoglobulin molecules include F(ab) and FCaV)* fragments which can be generated by 
treating the antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind a Pro4 1 2-related protein, or a biologically active fragment or 
homologuc thereof The term "monoclonal antibody" or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one species of an 

15 antigen binding site capable of immunoreacting with a particular epitope of a Pro4 12-related 

protein. A monoclonal antibody composition thus typically displays a single binding affinity for a 
particular Pro4 12-related protein with which it immunoreacts. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another 
20 nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a 

circular double stranded DNA loop into which additional DNA segments can be legated. Another 
type of vector is a viral vector, wherein additional DNA segments can be ligatcd into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which they are 
introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian 
25 vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a 
host cell upon introduction into the host cell, and thereby arc replicated along with the host 
genome. Moreover, certain vectors are capable of directing the expression of genes to which they 
are operatively linked. Such vectors are referred to herein as "expression vectors". In general, 
expression vectors of utility in recombinant DNA techniques arc often in the form of plasmids. In 
30 the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the 
most commonly used form of vector. However, the invention is intended to include such other 
forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses), which serve equivalent functions. 



35 Pro4 12-related proteins of the invention 

The Pro4 12-related proteins of the invention arc listed in SEQ ID NOS 1 to 4, SEQ ID 
NOS 2 and 4 being the sequences of the mature peptides obtained from SEQ ID NOS 1 and 3, 
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respectively. The sequences from SEQ ID NOS 1 and 3 represent two highly similar allelic 
variants, differing only at their C-terminus by the result of one regular Single Nucleotide 
Polymorphism (SNP) which mutates an E residue to a G residue, and one insertion SNP which 
induces a frameshift 

As described in Example 2, the Pro412-related proteins of the invention display in their C- 
lerm part a sequence predicted to be folded into an active site like the canonical ones found in the 
PEBP proteins. This structure has been described to form a ligand binding pocket at one end of the 
central beta sheet (Banfield et aL % supra). Interestingly, within this ligand-binding site, the Glu83 
in the human PEBP protein (highlighted in Figure 2), which has been shown to adopt a relatively 
rare Cis peptide bond conformation, is mutated to Phc in the Pro4l2-reIated proteins of the 
invention, which is a residue not commonly found in a Cis conformation. In the mouse homolog of 
the Pro412-related proteins of the invention, this residue is mutated to Tyr (see Figure 3). 

Furthermore, as described in Example 3, theN-ter part of the Pro412-related proteins of 
the invention is found to correspond to a conserved motif, which comprises a very clear signal 
peptide for secretion of the protein and a conserved repetition of 4 Cys residues. 

The Pro412-rclated proteins of the invention therefore represent a novel class of proteins 
which show an active ligand-binding site closely related to the previously characterized PEBP 
binding site, but which diverge from the PEBP proteins in their N-tcrm, by, firstly, being secreted 
proteins and not cytoplasmic proteins, and secondly, display a new structural domain conserved 
amongst many proteins and characterised by a conserved repetition of 4 Cys residues. 

Pro4J2-related nucleic acids 

The human Pro4l2-related gene sequence, shown herein to comprise 6 exons in SEQ ID 
NOS 10 to 16, which encodes the Pro412-related proteins. The huma Pro412-related gene is 
localized at chromosome 19. One aspect of the invention pertains to purified or isolated nucleic 
acid molecules that encode Pro412-related proteins or biologically active portions thereof as 
further described herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used 
for example in therapeutic and diagnostic methods and in drug screening assays as further 
described herein. 

An object of the invention is a purified, isolated, or recombinant nucleic acid coding for a 
Pro412-related protein, complementary sequences thereto, and fragments thereof. The invention 
also pertains to a purified or isolated nucleic acid comprising a polynucleotide having at least 95% 
nucleotide identity with a polynucleotide coding for a Pro4 12-rcluted protein, advantageously 99 
% nucleotide identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide 
identity with a polynucleotide coding for a Pro412-rclatcd protein, or a sequence complementary 
thereto or a biologically active fragment thereof. Another object of the invention relates to 
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purified, isolated or recombinant nucleic acids comprising a polynucleotide that hybridizes, under 
the stringent hybridization conditions defined herein, with a polynucleotide coding for a Pro412- 
related protein, or a sequence complementary thereto or a variant thereof or a biologically active 
fragment thereof. 

In another preferred aspect, the invention pertains to purified or isolated nucleic acid 
molecules that encode a portion or variant of a Pro412-relatcd protein, wherein the portion or 
variant displays a Pro412-related activity of the invention. Preferably said portion or variant is a 
portion or variant of a naturally occuring full-length Pro412-related protein. 

Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding 
a Pro412-related protein comprising, consisting essentially of, or consisting of the amino acid 
sequence ofSEQ ID NOS 1 to 4, or fragments thereof, wherein the isolated nucleic acid molecule 
encodes one or more motifs selected from the group consisting of a Pro412-relatcd target binding 
Tegion, a signal-anchor sequence and a substrate specificity determining region. For example, the 
purified, isolated or recombinant nucleic acid may comprise a genomic DNA or fragment thereof 
which encodes the polypeptide of SEQ ID NOS I to 4 or a fragment thereof, wherein the isolated 
nucleic acid molecule encodes one or more motifs selected from the group consisting of a Pro4l2- 
related target binding region, a nuclear localisation signal and an interferon gamma homology 
motif. Any combination of said motifs may also be specified. Preferably said Pro412-related 
target binding region is a target binding region or a DNA binding region. 

The nucleotide sequence determined from the cloning of the Pro412-rclatcd gene allows 
for the generation of probes and primers designed for use in identifying and/or cloning other 
Pro412-relatcd proteins family members (e.g. sharing the novel functional domains), as well as 
Pro412-related proteins homologues from other species. 

A nucleic acid fragment encoding a "biologically active portion of a Pro412-rclatcd 
protein" can be prepared by isolating a portion of a nucleotide sequence coding for a Pro4I2- 
relatcd protein, which encodes a polypeptide having a Pro412-re1atcd proteins biological activity, 
expressing the encoded portion of the Pro412-related protein (e.g., by recombinant expression in 
vitro or in vivo) and assessing the activity of the encoded portion of the Pro412-rclated protein. 

The invention further encompasses nucleic acid molecules that differ from the Pco412- 
related nucleotide sequences of the invention due to degeneracy of the genetic code and encode the 
same Pro412-*eIated proteins and fragment of the invention. 

In addition to the Pro412-rclatcd nucleotide sequences described above, it witl be 
appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in 
the amino acid sequences of the Pro412-related proteins may exist within a population (e.g. t the 
human population). Such genetic polymorphism may exist among individuals within a population 
due to natural allelic variation. Such natural allelic variations can typically result in 1-5% variance 
in the nucleotide sequence of the Pro412-related gene or nucleic acid sequence encoding Pro412- 
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related protein. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of the 
Prc412-related nucleic acids of the invention can be isolated based on their homology to the 
Pro412-rclatcd nucleic acids disclosed herein using the cDNAs disclosed herein, or a portion 
thereof; as a hybridization probe according to standard hybridization techniques under stringent 
hybridization conditions. 

Probes based on the Pro4 12-related nucleotide sequences can be used to detect transcripts 
or genomic sequences encoding the same or homologous proteins. In preferred embodiments, the 
probe further comprises a label group attached thereto, e.g., the label group can be a radioisotope, a 
fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissue which misexpress a Pro4 12-related protein, such as 
by measuring a level of a Pro412-relatcd proteins-encoding nucleic acid in a sample of cells from a 
subject e.g., detecting Pro4 12-related proteins mRNA levels or determining whether a genomic 
Pro412-related gene has been mutated or deleted. 

It will be appreciated that the invention comprises polypeptides having an amino acid 
sequence encoded by any of the polynucleotides of the invention. 

Uses of Pro4 12-related nucleic acids 

Polynucleotide sequences (or the complements thereof) which encode Pro4 12-related 
proteins have various applications, including uses as hybridization probes, in chromosome and 
gene mapping, and in the generation of antisense RNA and DNA. In addition, Pro4 12-related 
proteins-encoding nucleic acids are useful as targets for pharmaceutical intervention, e.g. for the 
development of DNA vaccines, and for the preparation of Pro4 12-related proteins by recombinant 
techniques, as described herein. The polynucleotides described herein, including sequence variants 
thereof, can be used in diagnostic assays. Accordingly, diagnostic methods based on detecting the 
presence of such polynucleotides in body fluids or tissue samples are a feature of the present 
invention. Examples of nucleic acid based diagnostic assays in accordance with the present 
invention include, but arc not limited to, hybridization assays, e.g., in situ hybridization, and PCR- 
based assays. Polynucleotides, including extended length polynucleotides, sequence variants and 
fragments thereof, as described herein, may be used to generate hybridization probes or PCR 
primers for use in such assays. Such probes and primers will be capable of detecting 
polynucleotide sequences, including genomic sequences that are similar, or complementary to, the 
Pro412-rclated proteins polynucleotides described herein 

The invention includes primer pairs for carrying out a PCR to amplify a segment of a 
polynucleotide of the invention. Each primer of a pair is an oligonucleotide having a length of 
between 15 and 30 nucleotides such that i) one pnmer of the pair forms a perfectly matched duplex 
with one strand of a polynucleotide of the invention and the other primer of the pair form a 
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perfectly match duplex with the complementary strand of the same polynucleotide, and ii) the 
primers of a pair form such perfectly matched duplexes at sites on the polynucleotide that 
separated by a distance of between 10 and 2500 nucleotides. Preferably, the annealing temperature 
of each primer of a pair to its respective complementary sequence is substantially the same. 

Hybridization probes derived from polynucleotides of the invention can be used, for 
example, in performing in situ hybridization on tissue samples, such as fixed or frozen tissue 
sections prepared on microscopic slides or suspended cells. Briefly, a labeled DNA or RNA probe 
is allowed to bind its DNA or RNA target sample in the tissue section on a prepared microscopic, 
under controlled conditions. Generally, dsDNA probes consisting of the DNA of interest cloned 
into a plasmid or bacteriophage DNA vector are used for this purpose, although ssDNA or ssRNA 
probes may also be used. Probes are generally oligonucleotides between about 15 and 40 
nucleotides in length. Alternatively, the probes can be polynucleotide probes generated by PCR 
random priming primer extension or in vitro transcription of RNA from plasmids (riboprobes). 
These latter probes are typically several hundred base pairs in length. The probes can be labeled by 
any of a number of methods, including fluorescent tags, enzymes or radioactive moieties, 
according to methods well known in the art. The particular detection method will correspond to the 
type of label utilized on the probe (e.g., autoradiography, X-ray detection, fluorescent or visual 
microscopic analysis, as appropriate). The reaction can be further amplified in situ using 
immunocytochemical techniques directed against the label of the detector molecule used, such 
antibodies directed to a fluorescein moiety present on a fluorcsccnlly labeled probe, or against 
avidin, or marker enzymes (peroxidase, alkaline phosphatase). Specific labeling and in situ 
detection methods can be found, for example, in Howard, G. C. Ed., Methods in Nonradioactive 
Detection. Appleton & Langc, Norwalk, Conn., (1993), herein incorporated by reference. One 
preferred assay for detecting nucleic acids encoding Pro412-related protein utilizes the subject 
polynucleotides, or fragments thereof, as primers in a PCR-based assay. According to the assay, 
nucleic acids present in a test tissue or cell sample are amplified by polymerase chain reaction 
(PCR) using two primers consisting of at least 15 nucleotides derived from a polynucleotide of the 
invention, including primers derived from variants and/or extensions of such sequences, as 
described herein. Amplification products are detected in the sample by a method that is appropriate 
to the particular label used to label the amplification products, according to methods as described 
in U.S. Pat No. 4.683,1 95 For use in PCR detection methods, such as PCR in situ hybridization, 
PCR primers arc selected to be at least 15 nucleotides in length, and preferably between about 1 5 
and 30 nucleotides in length, and are selected from the DNA molecule of interest, according to 
methods known in the art. It may be desirable to select, for primers, sequences that encompass the 
longer nucleotide sequences Preferably, the probes arc selected such that the two hybridization 
sites are separated by between about 10 to 1,000 nucleotides (occasionally up to about 10,000 
nucleotides). PCR in situ hybridization of tissue sections and/or cell samples provides a highly 
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sensitive detection method for rare cell types in fixed cell or tissue samples. The PGR in situ 
hybridization detection method is carried out in accordance with methods that are known in the art, 
e.g., Nuovo, G. J., PCR IN SITU HYBRIDIZATION: PROTOCOLS AND APPLICATIONS. Raven 
Press, N.Y., 1992; U.S. Pat No. 5,538,871, both of which arc incorporated herein by reference. 
5 Briefly, a cell sample (tissue on microscopic slide, pelleted cell suspension) is fixed using 

a common fixative preparation, such as buffered formalin, formaldehyde or the like. Proteinase or 
detergent treatment is favored following fixation, to increase cell permeability to reagents. The 
PCR reaction is carried out in situ by polymerase chain reaction (PCR) using two primers. As 
discussed above, the primers are designed to selectively amplify one or more of the nucleotide 

10 sequences described herein. The amplification reaction mixture contains, in addition to the target 
nucleotide sample and the primers, a thermostable DNA polymerase, such as a polymerase derived 
from Themus aquaiicus (Taq polymerase, U.S. Pat. 4,889,818), and a sufficient quantity of the four 
standard deoxyribonucleotides (dNTPs), one or more of which may be labeled to facilitate 
detection. The reaction mixture is subjected to several rounds of thcrmocycling to produce 

15 multiple copies (amplification products) of the target nucleotide sequence. Amplification products 
are then detected in the sample, for example by detecting radioactively labeled amplification 
products. Hybridization probes and PCR primers may also be selected from the genomic sequences 
corresponding to the full-length proteins identified in accordance with the present invention, 
including promoter, enhancer elements and introns of the gene encoding the naturally occurring 

20 polypeptide. Nucleotide sequences encoding a Pro4 12-rcIatcd protein can also be used to construct 
hybridization probes for mapping the gene which encodes that Pro412-related proteins and for the 
genetic analysis of individuals. The nucleotide sequences provided herein may be mapped to a 
chromosome and specific regions of a chromosome using known techniques, such as in situ 
hybridization, linkage analysis against known chromosomal markers, and hybridization screening 

25 with libraries. Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the Pro412-related proteins cDNA. Computer analysis of the 3* 
untranslated region is used to rapidly select primers that do not span more than one exon in the 
genomic DNA, which would complicate the amplification process. Individuals carrying variations 
of, or mutations in the gene encoding a Pro4l2-rclated proteins of the present invention may be 

30 detected at the DNA level by a variety of techniques. Nucleic acids used for diagnosis may be 
obtained from a patient's cells, including, for example, tissue biopsy and autopsy material. 
Genomic DNA may be used directly for detection or may be amplified cnzymatically by using 
PCR [Saiki, et al. Nature 324: 163-166 (1986)] prior to analysis. RNA or cDNA may also be used 
for the same purpose As an example, PCR primers complementary to the nucleic acid of the 

35 present invention can be used to identify and analyze mutations in the gene of the present 

invention. Deletions and insertions can be detected by a change in size of the amplified product in 
comparison to the normal genotype. Point mutations can be identified by hybridizing amplified 
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DNA to radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA sequences 
of the invention. Sequence changes at specific locations may also be revealed by nuclease 
protection assays, such RNase and SI protection or the chemical cleavage method [e.g. Cotton, ct 
at., Pvoc. NaiL Acad. Set. USA 85:4397-4401 (1985)], or by differences in melting temperatures. 

5 "Molecular beacons** [Kostrikis L. G. et al. Science 279:1228-1229 (1998)], hairpin-shaped, 

single-stranded synthetic oligonucleotides containing probe sequences which arc complementary to 
the nucleic acid of the present invention, may also be used to detect point mutations or other 
sequence changes as well as monitor expression levels of Pro4l2-rclatcd proteins. 

Antisense Compounds. Oligonucleotides of the invention, including per primers and 

10 antisense compounds, are synthesized by conventional means on a commercially available 

automated DMA synthesizer, e.g. an Applied Biosystems (Foster City, CA) model 380B, 392 or 
394 DNA/RNA synthesizer, or like instrument Preferably, phosphoramidite chemistry is 
employed, e.g. as disclosed in the following references: Beaucage and Iyer, Tetrahedron, 48: 
2223-231 1 (1992); Molko et al, U.S. patent 4,980,460; Kostcr ct al, U.S. patent 4,725,677; 

15 Caruthers et al, U.S. patents 4,4 15,732; 4,458,066; and 4,973,679; and the like. For therapeutic 
use, nuclease resistant backbones are preferred. Many types of modified oligonucleotides are 
available that confer nuclease resistance, e.g. phosphorothioate, phosphorodithioate, 
phosphoramidate, or the like, described in many references, e.g. phosphorothioates: Stec et al, 
U.S. patent 5,151,510; Hirschbein, U.S. patent 5,166,387; Bergot, U.S. patent 5,183,885; 

20 phosphoramidates: Frochlcr et al, International application PCT/US90/03 138; and for a review of 
additional applicable chemistries: Uhlmann and Peyman (cited above). The length of the 
antisense oligonucleotides, i.e. such contiguous subsequence, is sufficiently large to ensure that 
specific binding will take place only at the desired target polynucleotide and not at other fortuitous 
sites. The upper range of the length is determined by several factors, including the inconvenience 

25 and expense of synthesizing and purifying oligomers greater than about 30-40 nucleotides in 
length, the greater tolerance of longer oligonucleotides for mismatches than shorter 
oligonucleotides, and the like. Preferably, the antisense oligonucleotides of the invention have 
lengths in the range of about 15 to 40 nucleotides. More preferably, the oligonucleotide moieties 
have lengths in the range of about 18 to 25 nucleotides. 

30 

Pro412-relaied proteins 

The terms "Pro412-rclatcd protein" and "Pro412-rclated proteins" is used herein to 
embrace any and all of the proteins and polypeptides of the present invention Also forming part 
35 of the invention arc polypeptides encoded by the polynucleotides of the invention, as well as 

fusion polypeptides comprising such polypeptides* The invention embodies polypeptides encoded 
by the nucleic acid sequences of the Pro412-rclatcd gene, as well as the Pro412-related proteins 
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from humans, including isolated or purified Pro412-related proteins consisting of, consisting 
essentially of, or comprising the sequence of SEQ ID NOS 1 to 4. 

The present invention embodies isolated, purified, and recombinant polypeptides 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, 
more preferably at least 1 2, 1 5, 20, 25, 30, 40, 50, or 75 amino acids of SEQ ID NOS 1 to 4. In 
preferred embodiments the contiguous stretch of amino acids comprises the site of a mutation or 
functional mutation, including a deletion, addition, swap or truncation of the amino acids in the 
Pro412-related protein sequence. The invention also concerns the polypeptide encoded by the 
Pro412-reIated nucleotide sequences of the invention, or a complementary sequence thereof or a 
fragment thereof. 

One aspect of the invention pertains to isolated Pro41 2-related proteins, and biologically 
active portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise 
anti-Pro4l2-rclated antibodies. In one embodiment, native Pro412-related proteins can be isolated 
from cells or tissue sources by an appropriate purification scheme using standard protein 
purification techniques. In another embodiment, Pro4I2-rclatcd proteins are produced by 
recombinant DNA techniques Alternative to recombinant expression, a Pro41 2-related protein or 
polypeptide can be synthesized chemically using standard peptide synthesis techniques* 

Typically, biologically active portions comprise a domain or motif with at least one 
activity of a Pro4 1 2-related protein. The present invention also embodies isolated, purified, and 
recombinant portions or fragments of a Pro41 2-related protein comprising a contiguous span of at 
least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 
30, 40, 50 or 75 amino acids of SEQ ID NOS 1 to 4. Also encompassed are Pro41 2-related 
proteins which comprise between 10 and 20, between 20 and 50, between 30 and 60, between 50 
and 99 amino acids of SEQ ID NOS I to 4. In other preferred embodiments the contiguous stretch 
of amino acids comprises the site of a mutation or functional mutation, including a deletion, 
addition, swap or truncation of the amino acids in the Pro412-relatcd protein sequence* 

A biologically active Pro4l2-related protein may, for example, comprise at least 1, 2, 3, 5, 
10, 20 or 30 amino acid changes from the sequence of SEQ ID NOS 1 to 4, or may encode a 
biologically active Pro4 1 2-related protein comprising at least 1%, 2%, 3%, 5%, 8%, 10% or 15% 
changes in amino acids from the sequence of SEQ ID NOS 1 to 4. 

In a preferred embodiment, a Pro41 2-related protein comprises a Pro41 2-related target 
binding region and/or a signal-anchor sequence. The invention also concerns the polypeptide 
encoded by the Pro4l 2-related nucleotide sequences of the invention, or a complementary 
sequence thereof or a fragment thereof 

In other embodiments, the Prc4 1 2-related protein is substantially homologous to the 
sequences of SEQ ID NOS 1 to 4, and retains the functional activity of the Pro412-retated protein, 
yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described 
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further herein. Accordingly, in another embodiment, the Pro41 2-relatcd protein is a protein which 
comprises an amino acid sequence shares more than about 60% but less than 100% homology with 
the amino acid sequence of SEQ ID NOS 1 to 4 and retains the functional activity of the Pro4 12- 
related proteins of SEQ ID NOS I to 4, respectively. Preferably, the protein is at least about 30%, 
5 40%, 50%, 60%, 70%, 80%, 85%, 90%, 92%, 95%, 97%, 98%, 99% or 99.8% homologous to SEQ 
ID NOS 1 to 4, but is not identical to SEQ ID NOS 1 to 4. Preferably the Pro41 2-related proteins 
is less than identical (e.g. 100% identity) to a naturally occurring Pro41 2-related proteins. Percent 
homology can be determined as further detailed above. 

10 Assessing polypeptides activity 

It will be appreciated that the invention further provides methods of testing the activity of, 
or obtaining, functional fragments and variants of Pro4 1 2-relatcd proteins and Pro412-rclatcd 
nucleotide sequences involving providing a variant or modified Pro412-related proteins, or Pro412- 

15 related nucleic acid and assessing whether a polypeptide encoded thereby displays a Pro4l2- 
rclated activity of the invention. Encompassed is thus a method of assessing the function of a 
Pro4 1 2-related proteins polypeptide comprising : (a) providing a Pro4 12-rclatcd protein, or a 
biologically active fragment or homologue thereof; and (b) testing said Pro41 2-related protein, or a 
biologically active fragment or homologue thereof for a Pro41 2-related activity, preferably a 

20 Pro4 12-rclatcd activity. Any suitable format may be used, including cell free, cell-based and in 
vivo formats. For example, said assay may comprise expressing a Pro41 2-related nucleic acid in a 
host cell, and observing Pro4 12-rclatcd activity in said cell. In another example, a Pro41 2-related 
protein, or a biologically active fragment or homologue thereof is introduced to a cell, and a 
Pro41 2-related activity is observed. 

25 

A Pro4 12-rclatcd activity may be any activity as described herein, such as ( 1) mediating 
levels of a Pro4 12-rclatcd proteins substrate protein; (2) mediating levels of a Pro41 2-related 
proteins substrate cleavage product; (3) mediating cell proliferation or cell cycle; or (4) interaction 
with a Pro412-rclatcd target molecule or preferably Pro412-related domain target molecule, 
30 preferably interaction with a protein e.g- a substrate protein). Detecting Pro4 1 2-rclated activity 

may also comprise detecting any suitable therapeutic endpoint discussed herein in the section titled 
"Methods of Treatment". 

Methods for obtaining variant nucleic acids and polypeptides 

35 

In addition to naturally-occurring allelic variants of the Pro4 12-related proteins sequences 
thai may exist in the population, the skilled artisan will appreciate that changes can be introduced 
by mutation into the nucleotide sequences ofSRQ ID NOS 1 to 4, thereby leading to changes in the 
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amino acid sequence of the encoded Pro4 1 2-related proteins, with or without altering the 
functional ability of the Pro41 2-related proteins. 

Several types of variants are contemplated including 1) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue and 

5 such substituted amino acid residue may or may not be one encoded by the genetic code, or 2) one 
in which one or more of the amino acid residues includes a substituent group, or 3) one in which 
the mutated Pro4 1 2-related protein is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol), or 4) one in which the 
additional amino acids are fused to the mutated Pro4 12-related protein, such as a leader or signal- 

10 anchor sequence or a sequence which is employed for purification of the mutated Pro4 12-rclated 
protein or a preprotein sequence. Such variants arc deemed to be within the scope of those skilled 
in the art 

For example, nucleotide substitutions leading to amino acid substitutions can be made in 
the sequences that do not substantially change the biological activity of the protein. An amino acid 

15 residue-can be altered from the wild-type sequence encoding a Pro4 1 2-related protein, or a 
biologically active fragment or homologue thereof without altering the biological activity. In 
general, amino acid residues that are conserved among the Pro41 2-related proteins of the present 
invention? arc predicted to be less amenable to alteration. 

In one aspect, the invention pertains to nucleic acid molecules encoding Pro412-relatcd 

20 proteins, or biologically active fragments or homologues thereof that contain changes in amino 
acid residues that are not essential for activity. Such Pro4 12-related proteins differ in amino acid 
sequence from SEQ ID NOS 1 to 4 yet retain biological activity. In one embodiment, the isolated 
nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 60% homologous to an amino acid sequence 

25 selected from the group consisting of SEQ ID NOS I to 4. Preferably, the protein encoded by the 
nucleic acid molecule is at least about 65-70% homologous to an amino acid sequence selected 
from the group consisting of SEQ ID NOS 1 to 4, more preferably sharing at least about 75-80% 
identity with an amino acid sequence selected from the group consisting of SEQ ID NOS 1 to 4, 
even more preferably sharing at least about 85%, 90%, 92%, 95%, 97%, 98%, 99% or 99.8% 

30 identity with an amino acid sequence selected from the group consisting of SEQ ID NOS I to 4. 

In another aspect, the invention pertains to nucleic acid molecules encoding Pro4 12-related 
proteins that contain changes in amino acid residues that result in increased biological activity, or a 
modified biological activity. In another aspect, the invention pertains to nucleic acid molecules 
encoding Pro4l2-rclatcd proteins that contain changes in amino acid residues that arc essential for 

3S a Pro4 12-related activity. Such Pro4 1 2-reJated proteins differ in amino acid sequence from SEQ ID 
NOS 1 to 4 and display reduced acnviry, or essentially lack one or more Pro412-rclatcd proteins 
biological activities. 
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An isolated nucleic acid molecule encoding a Pro4 12-related protein, or a biologically 
active fragment or homologue thereof, homologous to a protein of any one of SEQ ID NOS 1 to 4 
can be created by introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID NOS 1 to 4 such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be introduced into 
any of SEQ ID NOS 1 to 4, by standard techniques, such as site-directed mutagenesis and PCR- 
mediated mutagenesis. For example, conservative amino acid substitutions may be made at one or 
more predicted non-essential amino acid residues. A "conservative amino acid substitution 1 * is one 
in which the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined in the arL These 
families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side 
chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, 
glutaminc, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, 
histidine). Thus, a predicted nonessential amino acid residue in a Pro412-rclatcd protein, or a 
biologically active fragment or homologue thereof may be replaced with another amino acid 
residue from the same side chain family. Alternatively, in another embodiment, mutations can be 
introduced randomly along all or part of a Pro4 12-related proteins coding sequence, such as by 
saturation mutagenesis, and die resultant mutants can be screened for Pro4 12-related proteins 
biological activity to identify mutants that retain activity. Following mutagenesis of one of SEQ ID 
NOS 1 to 4, the encoded protein can be expressed rccombinantly and the activity of the protein can 
be determined. 

In a preferred embodiment, a mutant Pro412-related domain polypeptide, or a biologically 
active fragment or homologue thereof encoded by a Pro412-relatcd nucleic acid of the invention 
can be assayed for a Pro4 12-related activity in any suitable assay, examples of which are provided 
herein. 

The invention also provides Pro4 12-related proteins chimeric or fusion proteins. As used 
herein, a Pro4 12-related proteins "chimeric protein" or "fusion protein" comprises a Pro4 12-related 
protein of the invention or fragment therof, opcrativcly linked, preferably fused in frame, to a non- 
Pro4l2-rclatcd protein. In a preferred embodiment, a Pro4 12-related fusion protein comprises at 
least one biologically active portion of a Pro412-rclated protein, hi another preferred embodiment, 
a Pro4 12-related fusion protein comprises at leasl two biologically active portions of a Pro412- 
rclatcd protein. For example, in one embodiment, the fusion protein is a GST-Pro412-rclatcd 
fusion protein in which a Pro412-re)atcd domain sequences are fused to the C-terminus of the GST 
sequences Such fusion proteins can facilitate the purification of recombinant Pro412-rclaled 
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proteins. In another embodiment, the fusion protein is a Pro412-related protein containing a 
heterologous signal sequence at its N-terminus, such as for example to allow for a desired cellular 
localization in a certain host cell. 

The Pro4l2-re1ated fusion proteins of the invention can be incorporated into 
5 pharmaceutical compositions and administered to a subject in vivo. Moreover, the Pro4 12~related 
fusion proteins of the invention can be used as immunogens to produce anti-Pro41 2 -related 
antibodies in a subject, to purify Pro41 2-relatcd proteins ligands and in screening assays to identify 
molecules which inhibit the interaction of Pro41 2-relatcd proteins polpeptide with a Pro412-related 
target molecule. 

LO Furthermore, isolated peptidyl portions of the subject Pro412-reIated proteins can also be 

obtained by screening peptides recombinant^ produced from the corresponding fragment of the 
nucleic acid encoding such peptides. In addition, fragments can be chemically synthesized using 
techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. 
For example, a Pro4 12-rclatcd protein of the present invention may be arbitrarily divided into 

15 fragments of desired length with no overlap of the fragments, or preferably divided into 

overlapping fragments of a desired length. The fragments can be produced (recombinantly or by 
chemical synthesis) and tested to identify those peptidyl fragments which can function as either 
agonists or antagonists of a Pro4l2-related activity, such as by microinjection assays or in vitro 
protein binding assays. In an illustrative embodiment, peptidyl portions of a Pro412-related 

20 protein, such as a Pro41 2-related target binding region, can be tested for Pro4 12-rclatcd activity by 
expression as thioredoxin fusion proteins, each of which contains a discrete fragment of the 
Pro412-related protein (see, for example, U.S. Patents 5, 270,18 1 and 5,292,646; and PCT 
publication W094/02502, the disclosures of which are incorporated herein by reference). 

The present invention also pertains to variants of the Pro412-rclatcd proteins which 

25 function as either Pro41 2-relatcd proteins mimetics or as Pro41 2-related inhibitors. Variants of the 
Pro412-rclated proteins can be generated by mutagenesis, e.g., discrete point mutation or 
truncation of a Pro41 2-related protein. An agonist of a Pro4 1 2-related protein can retain 
substantially the same, or a subset, of the biological activities of the naturally occurring form of a 
Pro4 1 2 -related protein. An antagonist of a Pro4 1 2-relatcd protein can inhibit one or more of the 

30 activities of the naturally occurring form of the Pro412-related protein by, for example, 

competitively inhibiting the association of Prc412-rclated protein with a Pro4 1 2-relatcd target 
molecule. Thus, specific biological effects can be elicited by treatment with a variant of limited 
function In one embodiment, variants of a Pro41 2-related protein which function as either Pro412- 
relatcd proteins agonists (mimetics) or as Pro41 2-relatcd proteins antagonists can be identified by 

35 screening combinatorial libraries of mutants, e.g., truncation mutants, of a Pro41 2-related protein 
for Pro4 1 2-related protein agonist or antagonist activity. In one embodiment, a variegated library 
of Pro4 12-rclatcd proteins variants is generated by combinatorial mutagenesis at the nucleic acid 
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level and is encoded by a variegated gene library. A variegated library of Pro4l2-related proteins 
variants can be produced by, for example, cnzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a degenerate set of potential Pro412-related 
proteins sequences is expressible as individual polypeptides, or alternatively, as a set of larger 
fusion proteins (e.g., for phage display) containing the set of Pro412-related proteins sequences 
therein. There are a variety of methods which can be used to produce libraries of potential Pro4l 2- 
relatcd proteins variants from a degenerate oligonucleotide sequence. Chemical synthesis ofa 
degenerate gene sequence can be performed in an automatic DNA synthesizer, and the synthetic 
gene then ligated into an appropriate expression vector. Use ofa degenerate set of genes allows for 
the provision, in one mixture, of all of the sequences encoding the desired set of potential Pro412- 
rclatcd proteins sequences. 

In addition, libraries of fragments ofa Pro412-related protein coding sequence can be used 
to generate a variegated population of Pro412-related proteins fragments for screening and 
subsequent selection of variants ofa Pro412-related protein. In one embodiment, a library of 
coding sequence fragments can be generated by treating a double stranded PCR fragment of 
Pro412-related proteins coding sequence with a nuclease under conditions wherein nicking occurs 
only about once per molecule, denaturing the double stranded DNA, renaturing the DNA to form 
double stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with S 1 nuclease, and 
ligating the resulting fragment library into an expression vector. By this method, an expression 
library can be derived which encodes N-tcrminal, C-terminal and internal fragments of various 
sizes of the Pro412-rclatcd protein. 

Modified Pro412-rclated proteins can be used for such purposes as enhancing therapeutic 
or prophylactic efficacy, or stability (e.g., cx vivo shelf life and resistance to proteolytic 
degradation in vivo). Such modified peptides, when designed to retain at least one activity of the 
naturally occurring form of the protein, arc considered functional equivalents of the Pro412-relatcd 
protein described in more detail herein. Such modified peptide can be produced, for instance, by 
amino acid substitution, deletion, or addition. 

Whether a change in the amino acid sequence ofa peptide results in a functional Pro4 1 2- 
related proteins homolog (e.g. functional in the sense that it acts to mimic or antagonize the wild- 
type form) can be readily determined by assessing the ability of the variant peptide to cleave one or 
more substrate proteins, or produce a response in cells, in a fashion similar to the wild-type 
Pro4 12-rclatcd protein or competitively inhibit such a response. Peptides in which more than one 
replacement has taken place can readily be tested in the same manner. 

This invention further contemplates a method of generating sets of combinatorial mutants 
of the presently disclosed Pro412-related proteins, as well as truncation and fragmentation 
mutants, and is especially useful for identifying potential variant sequences which are functional in 
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binding to or cleaving a Pro41 2-related proteins- target protein but differ from a wild-type form of 
the protein by, for example, efficacy, potency and/or intracellular half-life. One purpose for 
screening such combinatorial libraries is, for example, to isolate novel Pro4 1 2-related proteins 
homologs which function as either an agonist or an antagonist of the biological activities of the 
wild-type protein, or alternatively, possess novel activities all together. For example, mutagenesis 
can give rise to Pro4 1 2-related proteins homologs which have intracellular half-lives dramatically 
different than the corresponding wild-type protein. The altered protein can be rendered either more 
stable or less stable to activation of the protease proenzyme, proteolytic degradation, or cellular 
process which result in destruction of, or otherwise inactivation of, a Pro41 2-related protein. Such 
Pro41 2-related proteinshomologs, and the genes which encode them, can be utilized to alter the 
envelope of expression for a particular recombinant Pro4l2-rclated protcinsprotcin by modulating 
the half-life of the recombinant protein. For instance, a short half-life can give rise to more 
transient biological effects associated with a particular recombinant Pro4l2-related protein and, 
when part of an inducible expression system, can allow tighter control of recombinant protein 
levels within a cell. As above, such proteins, and particularly their recombinant nucleic acid 
constructs, can be used in gene therapy protocols. 

In an illustrative embodiment of this method, the amino acid sequences for a population of 
Pro412-related proteins homologs or other related proteins arc aligned, preferably to promote the 
highest homology possible. Such a population of variants can include, for example, Pro41 2-related 
proteins homologs from one or more species, or Prc412-rclatcd proteins homologs from the same 
species but which differ due to mutation. Amino acids which appear at each position of the aligned 
sequences are selected to create a degenerate set of combinatorial sequences. There arc many ways 
by which the library of potential Pro4l2-rclated proteins homologs can be generated from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
carried out in an automatic DNA synthesizer, and the synthetic genes then be ligated into an 
appropriate gene for expression. The purpose of a degenerate set of genes is to provide, in one 
mixture, all of the sequences encoding the desired set of potential Pro41 2-related proteins 
sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for example. 
Narang, SA (1983) Tetrahedron 393; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland 
Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) 
Annu. Rev. Biochcm. 53:323; Itakura el al. (1984) Science 198:1056; Bee ct al. (1983) Nucleic 
Acid Res. 1 1:477 Such techniques have been employed in the directed evolution of other proteins 
(see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89 2429- 
2433; Devlin et al (1990) Science 249: 404-406; Cwuia et al. (1990) PNAS 87: 6378-6382; as 
well as U.S Patents Nos: 5, 223,409, 5,198,346, and 5,096,815). The disclosures of the above 
references arc incorporated herein by reference in Ihcir entireties. 

Alternatively, other forms of mutagenesis can be utilized to generate a combinatorial 
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library, particularly where no other naturally occurring homologs have yet been sequenced. For 
example, Pro4 1 2-related proteins homologs (both agonist and antagonist forms) can be generated 
and isolated from a library by screening using, for example, alanine scanning mutagenesis and the 
like (Ruf et a). (1994) Biochemistry 33:1565-1572; Wang et a!. (1994) J Biol. Chcm. 269:3095- 

5 3099; Balint et al. (1 993) Gene 137:109-1 18; Grodberg et al. (1993) Eur. J Biochem. 218 597-601; 
Nagashimaet al. (1993) J Biol. Chcm. 268:2888-2892; Lowman ct al. (1991) Biochemistry 
30:10832-10838; and Cunningham et al. (1989) Science 244:1081-1085), by linker scanning 
mutagenesis (Gustin ct al (1993) Virology 193:653-660; Brown et al. (1992) Mol. Cell Biol. 
12:2644 2652; McKnight ct al. (1982) Science 232:316); by saturation mutagenesis (Meyers et al. 

10 (1986) Science 232:613); by PCR mutagenesis (Leung et al. (1989) Method Cell Mol Biol 1:1- 
19); or by random mutagenesis (Milter et al. (1992) A Short Course in Bacterial Genetics, CSHL 
Press, Cold Spring Harbor, NY; and Greener et al. (1994) Strategics in Mol Biol 7:32-34, the 
disclosures of which arc incorporated herein by reference in their entireties). 

A wide range of techniques are known in the art for screening gene products of 

15 combinatorial libraries made by point mutations, as well as for screening cDNA libraries for gene 
products having a certain property. Such techniques will be generally adaptable for rapid screening 
of the gene libraries generated by the combinatorial mutagenesis of Pro412-related proteins. The 
most widely used techniques for screening large gene libraries typically comprises cloning the 
gene library into replicable expression vectors, transforming appropriate cells with the resulting 

20 library of vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates relatively easy isolation of the vector encoding the gene whose product 
was detected. 

Each of the illustrative assays described below arc amenable to high throughput analysis as 
necessary to screen large numbers of degenerate Pro412-rclatcd proteins sequences created by 

25 combinatorial mutagenesis techniques. In one screening assay, the candidate gene products are 
displayed on the surface of a cell or viral particle, and the ability of particular cells or viral 
particles to bind a Pro412-related target molecule (for example a modified peptide substrate) via 
this gene product is detected m a "panning assay". For instance, the gene library can be cloned into 
the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein 

30 detected by panning (Ladner ct aK, WO 88/06630; Fuchs et al (1991) BioTcchnology 9: 1 370- 
1371, and Goward et al. (1992) TIBS 18:136 140). In a similar fashion, fluorescently labeled 
Pro4 1 2-related target can be used to score for potentially functional Pro4 1 2-related proteins 
homologs. Cells can be visually inspected and separated under a fluorescence microscope, or, 
where the morphology of the cell permits, separated by a fluorescence- activated cell sorter. 

35 In an alternate embodiment, the gene library is expressed as a fusion protein on the surface 

of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be 
expressed on the surface of infectious phage, thereby conferring two significant benefits. First, 
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since these phage can be applied to affinity matrices at very high concentrations, a large number of 
phage can be screened at one time. Second, since each infectious phage displays the combinatorial 
gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, 
the phage can be amplified by another round of infection. The group of almost identical E. coli 

5 filamentous phages M13, fd, and £1 are most often used in phage display libraries, as either of the 
phage gill or gVUI coat proteins can be used to generate fusion proteins without disrupting the 
ultimate packaging of the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et 
a!., PCT publication WO 92/09690; Marks et al. (1992) J Biol. Chcnu 267:16007-16010; Griffiths 
et al. (1993) EMBO J 12:725-734; Clackson et al. (199 1) Nature 352:624-628; and Barbas et al. 

10 (1992) PNAS 89:4457 4461, the disclosures of which are incorporated herein by reference in their 
entireties). In an illustrative embodiment, the recombinant phage antibody system (RPAS, 
Pharmacia Catalog number 27-9400-01) can be easily modified for use in expressing Pro412- 
related proteins combinatorial libraries, and the Pro412-rclated proteins phage library can be 
panned on immobilized Pro412-related target molecule (glutathione immobilized Pro412-related 

15 largct-GST fusion proteins or immobilized DNA). Successive rounds of phage amplification and 
panning can greatly enrich for Pro412-related proteins homologs which retain an ability to bind a 
Pro412-related target and which can subsequently be screened further for biological activities in 
automated assays, in order to distinguish between agonists and antagonists. 

The invention also provides for identification and reduction to functional minimal size of 

20 the Pro412-rclatcd proteins functional domains, to generate mimetics, e.g. peptide or non-peptide 
agents, which are able to disrupt binding of a polypeptide of the present invention with a Pro412- 
rclatcd target molecule (protein or DNA). Thus, such mutagenic techniques as described above are 
also useful to map the determinants of Pro412-related proteins which participate in protein-protein 
or protein-DNA interactions involved in, for example, binding to a Pro412-related target protein or 

25 DNA. To illustrate, the critical residues of a Pro412-related protein which are involved in 

molecular recognition of the Pro412-related target can be determined and used to generate Pro412- 
related target- 1 3P-derived pcplidomimetics that competitively inhibit binding of the Pro412-related 
protein to the Pro412-related target. By employing, for example, scanning mutagenesis to map the 
amino acid residues of a particular Pro412-rclated protein involved in binding a Pro412-related 

30 target, peptidomimetic compounds can be generated which mimic those residues in binding to a 
Pro412-related target, and which, by inhibiting binding of the Pro412-related protein to the 
Pro4 1 2-reIated target molecule, can interfere with the function of a Pro4 1 2-related protein in 
transcriptional regulation of one or more genes. For instance, non hydrolyzable peptide analogs of 
such residues can be generated using retro-inverse peptides (e.g., see U.S Patents 5,1 16,947 and 

35 5,219,089; and Pallai et al. (1983) Int J Pept Protein Res 21:84-92), benzodiazepine (e.g., see 
Freidingcr ct al. in Peptides: Chemistry and Biology, G.R. Marshall cd., ESCOM Publisher 
Leiden, Netherlands, 1988), aicpme (e.g., see Huffman et al. in Peptides.- Chemistry and Biology, 
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G.R, Marshall ed., ESCOM Publisher. Leiden, Netherlands, 1988), substituted gamma lactam rings 
(Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher. Leiden, 
Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; 
and Ewenson ct al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide 

5 Symposium) Pierce Chemical Co. Rockland, IL, 1985), P-turn dipeptide cores (Nagai et al. (1985) 
Tetrahedron Left 26:647; and Sato ct al. (1986) J Chem Soc Pcrkin Trans 1: 123 I), and P- 
aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Dann et al. 
(1986) Biochem Biophys Res Commun 134:71, the disclosures of which are incorporated herein by 
reference in their entireties). 

10 An isolated Pro4l2-relatcd protein, or a portion or fragment thereof, can be used as an 

immunogen to generate antibodies that bind Pro412-related proteins using standard techniques for 
polyclonal and monoclonal antibody preparation. A full-length Pro4l2-related protein can be used 
or, alternatively, the invention provides antigenic peptide fragments of Pro412-related proteins for 
use as immunogens. Any fragment of a Pro412-related protein which contains at least one 

1 5 antigenic determinant may be used to generate antibodies. The antigenic peptide of a Pro41 2- 
related protein comprises at least 6 amino acid residues of an amino acid sequence selected from 
the group consisting of SEQ ID NOS 1 to 4, and encompasses an epitope of a Pro412-related 
protein such that an antibody raised against the peptide forms a specific immune complex with a 
Pro412-related protein. Preferably, the antigenic peptide comprises at least 8 or 10 amino acid 

20 residues, more preferably at least 1 5 amino acid residues, even more preferably at least 20 amino 
acid residues, and most preferably at least 30 amino acid residues. 

Preferred epitopes encompassed by the antigenic peptide are regions of a Pro412-relatcd 
protein that are located on the surface of the protein, e.g., hydrophilic regions. 

25 A Pro4 12-related protein immunogen typically is used to prepare antibodies by 

immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for example, rccombinantly expressed 
Pro412-rclatcd protein or a chemically synthesized Pro4 12-related protein. The preparation can 
further include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar 

30 immunostimulatory agent Immunization of a suitable subject with an immunogenic Pro4 12-related 
protein preparation induces a polyclonal anti-Pro4 12-related antibody response. 

The invention concerns antibody compositions, either polyclonal or monoclonal, capable 
of selectively binding, or selectively bind to an epitopc-containing a polypeptide comprising a 
contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids. The invention 

35 also concerns a purified or isolated antibody capable of specifically binding to a mutated Pro4 12- 
relatcd protein or to a fragment or variant thereof comprising an epitope of the mutated Pro4 12- 
related protein. 
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Primers and probes 

PrimeTS and probes of the invention can be prepared by any suitable method, including, for 
example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 
method such as the phosphodiester method of Narang SA et al (Methods Enzymol 1979;68:90-98). 
the phosphodiester method of Brown EL et al (Methods Enzymol 1979;68: 109-151), the 
dicthylphosphorarmditc method of Bcaucagc ct al (Tetrahedron Lett 1981, 22: 1859-1862) and the 
solid support method described in EP 0 707 592, the disclosures of which are incorporated herein 
by reference in their entireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs 
such as, for example peptide nucleic acids which are disclosed in International Patent Application 
WO 92/20702, morpholino analogs which are described in U.S. Patents Numbered 5,185,444; 
5,034,506 and 5,142,047. If desired, the probe may be rendered "non-extendable" in that additional 
dNTPs cannot be added to the probe. In and of themselves analogs usually are non-extendable and 
nucleic acid probes can be rendered non-extendable by modifying the 3' end of the probe such that 
the hydroxyl group is no longer capable of participating in elongation. For example, the 3* end of 
the probe can be functionalized with the capture or detection label to thereby consume or otherwise 
block the hydroxyl group. 

Any of the polynucleotides of the present invention can be labeled, if desired, by 
incorporating any label known in the art to be detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include radioactive 
substances (including, ^P, 3 *S, *H, l2S I) f fluorescent dyes (including, 5-bromodesoxyuridin, 
fluorescein, acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides arc labeled 
at their 3* and 5* ends. Examples of non-radioactive labeling of nucleic acid fragments arc 
described in (Urdea et al. (Nucleic Acids Research. 1 1:4937-4957, 1988) or Sanchez-Pescador et 
al. (J Clin Microbiol. 26(10): 1934-1938, 1988). In addition, the probes according to the present 
invention may have structural characteristics such that they allow the signal amplification, such 
structural characteristics being, for example, branched DNA probes as those described by Urdea et 
al (Nucleic Acids Symp. Ser. 24:197-200, 1991) or in the European patent No. EP 0 225 807 
(Chiron). 

A label can also be used to capture the primer, so as to facilitate the immobilization of cither 
the primer or a primer extension product, such as amplified DNA, on a solid support A capture 
label is attached to the primers or probes and can be a specific binding member which forms a 
binding pair with the solid's phase reagent's specific binding member (e.g. biotin and streptavidin). 
Theiefore depending upon the type of label carried by a polynucleotide or a probe, it may be 
employed to capture or to detect the target DNA. Further, it will be understood that the 
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polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. 
For example, in the case where a solid phase reagent's binding member is a nucleic acid sequence, 
it may be selected such that it binds a complementary portion of a primer or probe to thereby 
immobilize the primer or probe to the solid phase. In cases where a polynucleotide probe itself 
serves as the binding member, those skilled in the art will recognize that the probe will contain a 
sequence or "tail" that is not complementary to the target. In the case where a polynucleotide 
primer itself serves as the capture label, at least a portion of the primer will be free to hybridize 
with a nucleic acid on a solid phase. DMA labeling techniques are well known to the skilled 
technician. 

The probes of the present invention are useful for a number of purposes. They can be 
notably used in Southern hybridization to genomic DNA. The probes can also be used to detect 
PGR amplification products. They may also be used to detect mismatches in the Pro412-reIated 
gene or a Pro412-related proteins mRNA using other techniques. 

Any of the nucleic acids, polynucleotides, primers and probes of the present invention can 
be conveniently immobilized on a solid support Solid supports are known to those skilled in the 
art and include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, 
nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other animal) red 
blood cells, duracyles and others. The solid support is not critical and can be selected by one 
skilled in the art Thus, latex particles, microparticles, magnetic or non-magnetic beads, 
membranes, plastic tubes, walls of microtitcr wells, glass or silicon chips, sheep (or other suitable 
animal's) red blood cells and duracytes are all suitable examples. Suitable methods for 
immobilizing nucleic acids on solid phases include ionic, hydrophobic, covalent interactions and 
the Jike. A solid support, as used herein, refers lo any material which is insoluble, or can be made 
insoluble by a subsequent reaction. The solid support can be chosen for its intrinsic ability to 
attract and immobilize the capture reagent Alternatively, the solid phase can retain an additional 
receptor which has the ability to attract and immobilize the capture reagent The additional 
receptor can include a charged substance that is oppositely charged with respect to the capture 
reagent itself or to a charged substance conjugated to the capture reagent As yet another 
alternative, the receptor molecule can be any specific binding member which is immobilized upon 
(attached to) the solid support and which has the ability to immobilize the capture reagent through 
a specific binding reaction. The receptor molecule enables the indirect binding of the capture 
reagent to a solid support material before the performance of the assay or during the performance 
of the assay. The solid phase thus can be a plastic, derivatized plastic, magnetic or non-magnetic 
metal, glass or silicon surface of a test tube, microliter well, sheet, bead, microparticle, chip, sheep 
(or other suitable animal's) red blood cells, duracytes and other configurations known lo those of 
ordinary skill in the art. The nucleic acids, polynucleotides, primers and probes of the invention 
can be attached to or immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 
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12, 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In addition, 
polynucleotides other than those of the invention may be attached to the same solid support as one 
or more polynucleotides of the invention. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
5 locations on a solid support. Alternatively the polynucleotides of the invention may be attached in 
an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
ordered array of polynucleotides is designed to be "addressable" where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 

10 typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 

15 known as the Genechips, and has been generally described in US Patent 5,143,854; PCT 

publications WO 90/15070 and 92/10092, the disclosures of which are incorporated herein by 
reference in their entireties. 

Chemical Manufacture ofPro412-related Compositions 

Peptides of the invention are synthesized by standard techniques, e.g. Stewart and Young, 
Solid Phase Peptide Synthesis, 2nd Ed. (Pierce Chemical Company, Rockford, 1L, 1984). 
Preferably, a commercial peptide synthesizer is used, e.g. Applied Biosystems, Inc. (Foster City. 
CA) model 430A, and polypeptides of the invention may be assembled from multiple, separately 
synthesized and purified, peptide in a convergent synthesis approach, e.g. Kent et al, U.S. patent 
6,184,344 and Dawson and Kent, Annu. Rev. Biochem., 69: 923-960 (2000). Peptides of the 
invention may be assembled by solid phase synthesis on a cross-linked polystyrene support starting 
from the carboxyl terminal residue and adding amino acids in a stepwise fashion until the entire 
peptide has been formed. The following references are guides to the chemistry employed during 
synthesis: Schnolzerct al, Int. J. Peptide Protein Res., 40: 180-193 (1992); Mcrrificld, J. Amcr. 
Chcm. Soc. Vol. 85. pg. 2149 (1963); Kent et al., pg 185, in Peptides 1984, Ragnareson, Rd 
(Almquist and Wcksell, Stockholm, 1984); Kent et al.. pg. 21 7 in Peptide Chemistry 84, Izumiya, 
Ed. (Protein Research Foundation, B.H. Osaka, 1985); Mcrrificld, Science, Vol. 232, pgs. 341-347 
(1986); Kent, Ann. Rev. Biochem.. Vol. 57. pgs. 957-989 (1988), and references cited in these 
latter two references. 

Preferably, chemical synthesis of polypeptides of the invention is carried out by the 
assembly of peptide fragments by native chemical ligation, as described by Dawson et al. Science, 
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266: 776-779 (1994) and Kent el al, U.S. patent 6,184,344. Briefly, in the approach a first peptide 
fragment is provided with an N-terminal cysteine having an unoxidized sulfhydryl side chain, and a 
second peptide fragment is provided with a C-terminal thioestcr. The unoxidized sulfhydryl side 
chain of the N-term inal cysteine is then condensed with the C-terminal thioester to produce an 
intermediate peptide fragment which links the first and second peptide fragments with a P- 
aminothioester bond. The p-aminothioester bond of the intermediate peptide fragment then 
undergoes an intramolecular rearrangement to produce the peptide fragment product which links 
the first and second peptide fragments with an amide bond. Preferably, the N-terrainal cysteine of 
interna] fragments are protected from undesired cyclization and/ro concatenation reactions by a 
cyclic thiazolidine protecting group as described below. Preferably, such cyclic thiazolidine 
protecting group is a thioprolinyl group. 

Peptide fragments having a C-terminal thioester may be produced as described in the 
following references, which are incorporated by reference: Kent ct al, U.S. patent 6,1 84,344; Tarn 
ct al, Proc. Natl. Acad. Sci., 92: 12485-12489 (1995); Blake, Int J. Peptide Protein Res., 17- 273 
(1981); Canne et al, Tetrahedron Letters, 36: 1217-1220 (1995); Hackeng et al, Proc. Natl. Acad. 
Sci., 94: 7845-7850 (1997); or Hackeng ct al, Proc. Natl. Acad. Sci., 96: 10068-10073 (1999). 
Preferably, the method described by Hackeng et al (1999) is employed. Briefly, peptide fragments 
are synthesized on a solid phase support (described below) typically on a 0.25 mmo! scale by using 
the in situ neutralization/HBTU activation procedure for Boc chemistry dislosed by Schnolzer et 
al, Int. J. Peptide Protein Res., 40: 180-193 (1992), which reference is incorporated herein by 
reference. (HBTU is 2-(lH-benzotria2ol-l-yl)-l,13 > 3-tetramethyluronium hexafluorophosphate 
and Boc is tert-butoxycarbonyl). Each synthetic cycle consists of N*-Boc removal by a 1- to 2- 
minute treatment with neat TFA, a 1 -minute DMF flow wash, a 10- to 20-minute coupling time 
with 1.0 mmot of prcaclivated Boc-amino acid in the presence of DIEA, and a second DMF flow 
wash. (TFA is trifluoroacetic acid, DMF is N,N-dimcthylformamide, and DIEA is NJ4- 
diisopropylcthylamine). N a -Boc-amino acids (U mmol) are preactivatcd for 3 minutes with 1.0 
mmol of HBTU (0.5 M in DMF) in the presence of excess DIEA (3 mmol). After each coupling 
step, yields are determined by measuring residual free amine with a conventional quantitative 
ninhydrin assay, e.g. as disclosed in Sarin ctal, Anal. Biochem., 117: 147-157(1981). After 
coupling of Gin residues, a DCM flow wash is used before and after deprotection by using TFA, to 
prevent possible high-temp erarurc (TFA/DMF)-catalyzcd pyrroiidone formation. After chain 
assembly is completed, the peptide fragments arc deprotectcd and cleaved from the resin by 
treatment with anhydrous HF for I hour at 0°C with 4%p-cresol as a scavenger. The imidazole 
side-chain 2,4-dinitrophcnyl (dnp) protecting groups remain on the His residues because the dnp- 
removat procedure is incompatible with C-terminal thioestcr groups. However, dnp is gradually 
removed by thiols during the ligation reaction. After cleavage, peptide fragments arc precipitated 
with ice-cold diethylethcr, dissolved in aqueous acctomtrilc, and lyophilized. 
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Thioestcr peptide fragments described above are preferably synthesized on a trityl- 
associated mercaptopropionic acid-leucine (TAMPAL) resin, made as disclosed by Hackeng et al 
(1999), or comparable protocol. Briefly, N°-Boc-Lcu (4 mmol) is activated with 3.6 mmol of 
HBTU in the presence of 6 mmol of DIEA and coupled for 16 minutes lo 2 mmol of p- 

5 methylbenzhydrylamine (MBHA) resin, or the equivalent Next, 3 mmol of S-trityl 

mercaptopropionic acid is activated with 2.7 mmol of HBTU in the presence of 6 mmol of DTEA 
and coupled for 1 6 minutes to Leu-MBHA resin. The resulting TAMPAL resin can be used as a 
starting resin for polypeptide-chain assembly after removal of the trityl protecting group with two 
1 -minute treatments with 3.5% triisopropylsiiane and 2.5% H 2 0 in TFA. The tbioester bond can 

10 be formed with any desired amino acid by using standard in situ-ncutralization peptide coupling 
protocols for 1 hour, as disclosed in Schnolzer et al (cited above). Treatment of the final peptide 
fragment with anhydrous HF yields the C-terminal activated mercaptopropionic acid-leucine 
(MPAL) thioester peptide fragments. 

Preferably, thiazolidine-protected thioestcr peptide fragment intermediates arc used in 

15 native chemical ligation under conditions as described by Hackeng et al (1999), or like conditions. 
Briefly, 0.1 M phosphate buffer (pH 8.5) containing 6 M guanidine, 4% (vol/vol) 
benzylrnercaptan, and 4% (vol/vol) thiophenol is added to dry peptides to be ligated, to give a final 
peptide concentration of 1-3 mM at about pH 7, lowered because of the addition of thiols and TFA 
from the lyophilized peptide. Preferably, the ligation reaction is performed in a heating block at 

20 37°C and is periodically vortexed to equilibrate the thiol additives. The reaction may be monitored 
for degree of completion by MALDI-MS or HPLC and electrospray ionization MS. 

After a native chemical ligation reaction is completed or stopped, the N-terminal 
thiazolidine ring of the product is opened by treatment with a cysteine deprotccting agent, such as 
Omcthylhydroxylamine (0.5 M) at pH 3.5-4.5 for 2 hours at 37°C , after which a 10-fold excess of 

25 Tris-(2-carboxyethyl)-phosphine is added to the reaction mixture to completely reduce any 

oxidizing reaction constituents prior to purification of the product by conventional preparative 
HPLC. Preferably, fractions containing the ligation product arc identified by electrospray MS, are 
pooled, and lyophilized. 

After the synthesis is completed and the final product purified, the final polypeptide 

30 product may be refolded by conventional techniques, e.g. Crcighton, Mcth. Enzymol., 107: 305- 
329 (1984); White, Mcth. Enzymol., 11: 481-484 (1967); WeUaufcr, Meth. Enzymol., 107: 301- 
304 (1984), and the like Preferably, a final product is refolded by air oxidation by the following, 
or like: The reduced lyophilized product is dissolved (at about 0.1 mg/mL) in 1 M guanidine 
hydrochloride (or like chaolropic agent) with 100 mM Tris, 10 mM methionine, at pH 8.6. After 




35 gentle overnight stirring, the re-folded product is isolated by reverse phase HPLC with 
conventional protocols. 
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Recombinant Expression Vectors and Host Cells 

The polynucleotide sequences described herein can be used in recombinant DNA 
molecules that direct the expression of the corresponding polypeptides in appropriate host cells. 
Because of the degeneracy in the genetic code, other DNA sequences may encode the equivalent 
amino acid sequence, and may be used to clone and express the Pro412-related proteins. Codons 
preferred by a particular host cell may be selected and substituted into the naturally occurring 
nucleotide sequences, to increase the rate and/or efficiency of expression. The nucleic acid (e.g., 
cDNA or genomic DNA) encoding the desired Pro4i2-relatcd protein may be inserted into a 
rcplicablc vector for cloning (amplification of the DNA), or for expression. The polypeptide can be 
expressed recombinantly in any of a number of expression systems according to methods known in 
the art (Ausubel, et al., editors, Current Protocols in Molecular Biology, John Wiley & Sons, New 
York, 1990). Appropriate host cells include yeast, bacteria, archebacteria, fungi, and insect and 
animal cells, including mammalian cells, for example primary cells, including stem cells, 
including, but not limited lo bone marrow stem cells. More specifically, these include, but are not 
limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid 
or cosmid DNA expression vectors, and yeast transformed with yeast expression vectors. Also 
included, are insect cells infected with a recombinant insect virus (such as baculovirus), and 
mammalian expression systems. The nucleic acid sequence to be expressed may be inserted into 
Ihe vector by a variety of procedures. In general, DNA is inserted into an appropriate restriction 
endonuclease site using techniques known in the art. Veclor components generally include, but are 
not limited to, one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of 
suitable vectors containing one or more of these components employs standard ligation techniques 
which are known to the skilled artisan. 

The Pro4 1 2-related proteins of the present invention are produced by culturing a host cell 
transformed with an expression vector containing a nucleic acid encoding a Pro4 1 2-related 
proteins, under the appropriate conditions to induce or cause expression of the protein. The 
conditions appropriate for Pro412-related proteins expression will vary with the choice of the 
expression vector and the host cell, and will be easily ascertained by one skilled in the art through 
routine experimentation. For example, the use of constitutive promoters in the expression vector 
will require optimizing the growth and proliferation of the host cell, while the use of an inducible 
promoter requires the appropriate growth conditions for induction. In addition, in some 
embodiments, the timing of the harvest is important For example, the baculoviral systems used in 
insect cell expression are lytic viruses, and thus harvest time selection can be crucial for product 
yield. 

A host cell strain may be chosen for its ability to modulate the expression of the inserted 
sequences or to process the expressed protein in the desired fashion. Such modifications of the 
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protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
Hpidation and acylation. Post-translational processing, which cleaves a "prcpro" form of the 
protein, may also be important for correct insertion, folding and/or function. By way of example, 
host cells such as CHO, HeLa, BHK, MDCK, 293, W138, etc. have specific cellular machinery 
5 and characteristic mechanisms for such post-translational activities and may be chosen to ensure 
the correct modification and processing of the introduced, foreign protein. Of particular interest arc 
Drosophila melangastev cells, Sacckavomyces cevevisiae and other yeasts, E. coli. Bacillus 
subtihs, SF9 cells, C129 cells, 293 ceils, Neurospora, BHK, CHO, COS, and HeLa cells, 
fibroblasts, Schwanoma cell lines, immortalized mammalian myeloid and lymphoid cell lines, 
10 Jukat cells, human cells and other primary cells. 

The nucleic acid encoding a Pro4l2-related proteins must be "operably linked" by placing 
it into a functional relationship with another nucleic acid sequence. For example, DNA for a 
presentence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a 
preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably 
15 linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding 
site is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" DNA sequences are contiguous, and, in the case of a secretory leader, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic 
20 oligonucleotide adaptors or linkers arc used in accordance with conventional practice. Promoter 
sequences encode either constitutive or inducible promoters. The promoters may be cither 
naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine elements of 
more than one promoter, are also known in the art, and are useful in the present invention. The 
expression vector may comprise additional elements, for example, the expression vector may have 
25 two replication systems, thus allowing it to be maintained in two organisms, for example in 

mammalian or insect cells for expression and in a procaryotic host for cloning and amplification. 
Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to 
replicate in one or more selected host cells. Such sequences are well known for a variety of 
bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most 
30 Gram-negative bacteria, the 2: plasmid origin is suitable for yeast, and various viral origins (SV40, 
polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells. Further, for 
integrating expression vectors, the expression vector contains at least one sequence homologous to 
the host cell genome, and preferably, two homologous sequences which flank the expression 
construct. The integrating vector may be directed to a specific locus in the host cell by selecting 
35 the appropriate homologous sequence for inclusion in the vector. Constructs for integrating vectors 
are well known m the art 

Preferably, the expression vector contains a selectable marker gene to allow the selection 
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of transformed host cells. Selection genes are well known in the art and will vary with the host 
cell used. Expression and cloning vectors will typically contain a selection gene, also termed a 
selectable marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics 
or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement 

5 auxotrophic deficiencies, or (c) supply critical nutrients not available for from complex media, 
e.g., the gene encoding D-alanine racemase for Bacilli. 

Host cells transformed with a nucleotide sequence encoding a Pro412HPclated protein may 
be cultured under conditions suitable for the expression and recovery of the encoded protein from 
cell culture. The protein produced by a recombinant cell may be secreted, membrane-bound, or 

10 contained intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides encoding the Pro4 1 2- 
rclated proteins can be designed with signal sequences which direct secretion of the Pro4I2-related 
proteins through a prokaryotic or eukaryotic cell membrane. The desired Pro4 12-related protein 
may be produced rccombinantly not only directly, but also as a fusion polypeptide with a 

15 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific 
cleavage site at the N-tcrminus of the mature protein or polypeptide. In general, the signal 
sequence may be a component of the vector, or it may be a part of the Pro4 1 2-related proteins- 
encoding DNA that is inserted into the vector. The signal sequence may be a prokaryotic signal 
sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or 

20 heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, eg.* the yeast 
invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a-factor 
leaders, the latter described in U.S. Pat No. 5,010,182), or acid phosphatase leader, the C albicans 
glucoamylasc leader (EP 362,179 published Apr. 4, 1990), or the signal described in WO 
901 13646 published Nov. 15, 1990. In mammalian cell expression, mammalian signal sequences 

25 may be used to direct secretion of the protein, such as signal sequences from secreted polypeptides 
of the same or related species, as well as viral secretory leaders. According to the expression 
system selected, the coding sequence is inserted into an appropriate vector, which in turn may 
require the presence of certain characteristic "control elements" or "regulatory sequences." 
Appropriate constructs are known generally in the art (Ausubcl, ct al., 1990) and, in many cases, 

30 arc available from commercial suppliers such as Invitrogcn (San Diego, Calif.), Stratagenc (La 
Jolia, Calif.), Gibco BRL (Rockville, Md.) or Clontech (Palo Alto, Calif). 



Expression in Bacterial Systems. 

Transformation of bacterial cells may be achieved using an inducible promoter such as the 
35 hybrid lacZ promoter of the "BLUESCRIPT" Phagcmid (Stratagene) or 4 *pSPORTl" (Gibco 
BRL). In addition, a number of expression vectors may be selected for use in bacterial cells to 
produce cleavablc fusion proteins that can be easily detected and/or purified, including, but not 
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limited to "BLUESCRIPT" (a-galactosidase; Stratagene) or pGEX (glutathione S-transferase; 
Promega, Madison, Wis.). A suitable bacterial promoter is any nucleic acid sequence capable of 
binding bacterial RNA polymerase and initiating the downstream (3') transcription of the coding 
sequence of the Pro412-related gene into mRNA. A bacterial promoter has a transcription initiation 

5 region which is usually placed proximal to the 5* end of the coding sequence. This transcription 
initiation region typically includes an RNA polymerase binding site and a transcription initiation 
site. Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, such 
as galactose, lactose and maltose, and sequences derived from biosynthetic enzymes such as 

10 tryptophan. Promoters from bacteriophage may also be used and are known in the art. In addition, 
synthetic promoters and hybrid promoters are also useful; for example, the tat promoter is a hybrid 
of the trp and lac promoter sequences. Furthennorc, a bacterial promoter can include naturally 
occurring promoters of non-bactcrial origin that have the ability to bind bacterial RNA polymerase 
and initiate transcription. An efficient ribosome binding site is also desirable. The expression 

1 5 vector may also include a signal peptide sequence that provides for secretion of the Pro4 12-rclatcd 
protein in bacteria. The signal sequence typically encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of the protein from die cell, as is well known 
in the art The protein is cither secreted into the growth media (gram-positive bacteria) or into the 
periplasmic space, located between the inner and outer membrane of the cell (gram-negative 

20 bacteria) The bacterial expression vector may also include a selectable marker gene to allow for 
the selection of bacterial strains that have been transformed. Suitable selection genes include drug 
resistance genes such as ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin and 
tetracycline. Selectable markers also include biosynthetic genes, such as those in the histidinc, 
tryptophan and leucine biosynthetic pathways. When large quantities of Pro412-rclatcd proteins 

25 arc needed, e.g., for the induction of antibodies, vectors which direct high level expression of 

fusion proteins that are readily purified may be desirable. Such vectors include, but are not limited 
to, multifunctional E. coli cloning and expression vectors such as BLUESCRJPT (Stratagene), in 
which the Pro4 1 2-rclated protein coding sequence may be ligatcd into the vector in-frame with 
sequences for the armno-terminal Met and the subsequent 7 residues of beta-galactosidase so that a 

30 hybrid protein is produced; PIN vectors [Van Heeke & Schuster JBiol Chem 264:5503-5509 
1989)]; PET vectors (Novagen, Madison Wis.); and the like. Expression vectors for bacteria 
include the various components set forth above, and are well known in the art. Examples include 
vectors for Bacillus subtil is, & coll. Streptococcus cvemovis* and Streptococcus hvldans, among 
others. Bacterial expression vectors are transformed into bacterial host cells using techniques well 

35 known in the art, such as calcium chloride mediated transection, electroporation, and others. 
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Expression in YeasL 

Yeast expression systems are well known in the art, and include expression vectors For 
Sacchavomyces cevevisiae. Candida albicans and C. maltosa. Hansenula potymovpha. 
Ktuyvevomyces foagilis and K. tact is, Pichia guillevimondii and Ppastoris t Sehizosaccha-vomyces 
pombe, and Yawowia lipolytics Examples of suitable promoters for use in yeast hosts include the 
promoters for 3-phosphoglyccratc kinase [Hitzcman et aL,/. Biol. Chem. 255:2073 (1980)] or 
other glycolytic enzymes [Hess et al. JL Adv. Enzyme Reg. 7: 1 49 ( 1 968); Holland, Biochemistry 
17:4900 (1978)], such as cnolasc, gIyccraldchydc-3- phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose- 6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, tri osephosphate isomerase, phosphoglucosc isomerase, 
alpha factor, the ADH2IGAPDH promoter, glucokinase alcohol oxidase, and PGR [See, for 
example, Ausubel, et ah, 1990; Grant et al., Methods in Enzymology 153:516-544, (1987)]. Other 
yeast promoters, which are inducible have the additional advantage of transcription controlled by 
growth conditions, include the promoter regions for alcohol dehydrogenase 2, isocytochrome C, 
acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, 
glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose 
utilization. Suitable vectors andpromotcrs for use in yeast expression arc further described in EP 
73,657. Yeast selectable markers include ADE2. HIS4. LEU2. TRP1. and ALG7, which confers 
resistance'to runicamycin; the neomycin phosphotransferase gene, which confers resistance to 
G418; and the CUP I gene, which allows yeast to grow in the presence of copper ions. Yeast 
expression vectors can be constructed for intracellular production or secretion of a Pro412-tclatcd 
proteins from the DNA encoding the Pro412-related proteins of interest. For example, a selected 
signal peptide and the appropriate constitutive or inducible promoter may be inserted into suitable 
restriction sites in the selected plasmid for direct intracellular expression of the Pro412-related 
protein. For secretion of the Pro4!2-related proteins, DNA encoding the Pro412-related protein can 
be cloned into the selected plasmid, together with DNA encoding the promoter, the yeast alpha- 
factor secretory signal/leader sequence, and linker sequences (as needed), for expression of the 
Pro412-related protein. Yeast cells, can then be transformed with the expression plasmids 
described above, and cultured in an appropriate fermentation media. The protein produced by such 
transformed yeast can then be concentrated by precipitation with 10% trichloroacetic acid and 
analyzed following separation by SDS-PAGE and staining of the gels with Coomassie Blue stain. 
The recombinant Pro4 12-i elated proteins can subsequently be isolated and purified from the 
fermentation medium by techniques known to those of skill in the art. 

Expression in Mammalian Systems. 

The Pro412-related proteins may be expressed in mammalian cells. Mammalian expression 
systems are known in the art, and include retroviral vector mediated expression systems. 
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Mammalian host cells may be transformed with any of a number of different viral-based 
expression systems, such as adenovirus, where the coding region can be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader sequence. 
Insertion in a nonessential El or E3 region of the viral genome results in a viable vims capable of 

5 expression of the polypeptide of interest in infected host cells. A preferred expression vector 
system is a retroviral vector system such as is generally described in PCT/US97/0I019 and 
PCT/US97/101048. Suitable mammalian expression vectors contain a mammalian promoter which 
is any DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence for Pro4i2-relaled protein into mRNA. A 

10 promoter will have a transcription initiating region, which is usually placed proximal to the 5* end 
of the coding sequence, and a TATA box, using a located 25-30 base pairs upstream of the 
transcription initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA 
synthesis at the correct site. A mammalian promoter will also contain an upstream promoter 
element (enhancer element), typically located within 100 to 200 base pairs upstream of the TATA 

1 5 box. An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad host 
range. Examples include promoters obtained from the genomes of viruses such as polyoma virus, 
fowlpox virus (UK 2,21 1, 504 published Jul. 5,1989), adenovirus (such as Adenovirus 2), bovine 

20 papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian 
Virus 40 (SV40), from heterologous mammalian promoters, e.g., the actin promoter or an 
immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. Transcription of a DNA encoding a Pro4l2-relatcd protein 
by higher cukaryotcs may be increased by inserting an enhancer sequence into the vector. 

25 Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter 
to increase its transcription. Many enhancer sequences are now known from mammalian genes 
(globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer, the cytomegalovirus 
early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and 

30 adenovirus enhancers. The enhancer is preferably located at a site 5' from the promoter. In general, 
the transcription termination and polyadenylation sequences recognized by mammalian cells are 
regulatory regions located 3' to the translation stop codon and thus, together with the promoter 
elements, flank the coding sequence. The 3* terminus of the mature mRNA is formed by site- 
specific post-translational cleavage and polyadenylation. Examples of transcription terminator and 

35 polyadenylation signals include those derived from S V40 Long term, high-yield production of 
recombinant proteins can be effected in a stable expression system. Expression vectors which 
contain viral origins of replication or endogenous expression elements and a selectable marker 
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gene may be used for this purpose. Appropriate vectors containing selectable markers for use in 
mammalian cells are readily available commercially and are known to persons skilled in the art. 
Examples of such selectable markers include, but are not limited to herpes simplex virus thymi- 
dine kinase and adenine phosphoribosyltransfcrasc for use in tk- or hprt-ceUs, respectively. The 
5 methods of introducing exogenous nucleic acid into mammalian hosts, as well as other hosts, is 
well known in the art, and will vary with the host cell used. Techniques include dextran-mediated 
transfection, calcium phosphate precipitation, polybrenc mediated transfection, protoplast fusion, 
electroporation, viral infection, encapsulation of the polynucleotides) in liposomes, and direct 
microinjection of the DNA into nuclei. 

10 

Expression in Insect Cells. 

Pro4 12-related proteins may also be produced in insect cells. Expression vectors for the 
transformation of insect cells, and in particular, baculo virus-based expression vectors, are well 
known m the art. In one such system, the Pro4 12-related proteins-encoding DNA is fused upstream 

15 of an epitope tag contained within a baculovinis expression vector. Autographa califovnica nuclear 
polyhcdrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptevu fmgipevdu 
Sf9 cells or in Trichoplusia larvae. The Pro412-rclatcd proteins-encoding sequence is cloned into a 
nonessential region of the virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of a Pro4 12-related proteins-encoding sequence will 

20 render the polyhedrin gene inactive and produce recombinant virus lacking coat protein coat. The 
recombinant viruses are then used to infect S. fmgipevdu cells or Trichoplusia larvae in which the 
Pro4 12-related proteins is expressed [Smith et al., J. Wol. 46:584 (1994); Engelhard E Ket aL, 
Pvoc. Nat. Acad Set 91:3224-3227 (1994)]. Suitable epitope tags for fusion to the Pro4 12-related 
proteins-encoding DNA include poly-his tags and immunoglobulin tags (like Fc regions of IgG). A 

25 variety of plasmids may be employed, including commercially available plasmids such as 

pVL1393 (Novagcn). Briefly, the Pro412-reIated proteins-encoding DNA or the desired portion of 
the Pro4 12-related proteins-encoding DNA is amplified by PCR with primers complementary to 
the 5* and 3* regions. The 5* primer may incorporate flanking restriction sites. The PCR product is 
then digested with the selected restriction enzymes and subcloned into an expression vector. 

30 Recombinant bacutovirus is generated by co-transfecting the above plasmid and BaculoGoloTM 
virus DNA (Pharmmgcn) into Spodopteva Jvugipevda ("Sf9") cells (ATCC CRL 17 1 1) using 
lipofectin (commercially available from CIBCO-BRL), or other methods known to those of skill in 
the art Vims is produced by day 4-5 of culture in Sf9 cells at 28°C. f and used for further 
amplifications. Procedures are performed as further described in O'Reilley et al.. BACULOVIRUS 

35 EXPRESSION VECTORS: A LABORATORY MANUAL. Oxford University Press (1994). Extracts 
may be prepared from recombinant virus-infected Sf9 cells as described in Rupert et al., Nature 
362: 175-179 (199%) Alternatively, expressed epitopc-tagged Pro4 12-related proteins can be 
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purified by affinity chromatography, or for example, purification of an IgG tagged (or Fc tagged) 
Pro412-related protein can be performed using chromatography techniques, including Protein A or 
protein G column chromatography. 

5 Evaluation of Gene Expression. 

Gene expression may be evaluated in a sample directly, for example, by standard 
techniques known to those of skill in the art, e.g.. Southern blotting for DNA detection. Northern 
blotting to determine the transcription of rnRNA, dot blotting (DNA or RNA), or in situ 
hybridization, using an appropriately labeled probe, based on the sequences provided herein. 

10 Alternatively, antibodies may be used in assays for detection of nucleic acids, such as specific 
duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA- 
protein duplexes. Such antibodies may be labeled and the assay carried out where the duplex is 
bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody 
bound to the duplex can be detected. Gene expression, alternatively, may be measured by 

15 immunohistochernical staining of cells or tissue sections and assay of cell culture or body fluids, to 
directly evaluate the expression ofPro4l2-related proteinss. Antibodies useful for such 
immunological assays may be either monoclonal or polyclonal, and may be prepared against a 
native sequence Pro4 12-related proteins based on the DNA sequences provided herein. 

20 Purification of Expressed Protein. 

Expressed Pro4 12-related proteins may be purified or isolated after expression, using any 
of a variety of methods known to those skilled in the art. The appropriate technique will vary 
depending upon what other components are present in the 

sample. Contaminant components that arc removed by isolation or purification are materials that 
25 would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may include 
enzymes, hormones, and other solutes. The purification step(s) selected will depend, for example, 
on the nature of the production process used and the particular Pro4 12-related protein produced. A 
Pro4 1 2-rclated protein or protein may be recovered from culture medium or from host cell lysatcs. 
If membrane-bound, it can be released from the membrane using a suitable detergent solution (eg. 
30 Triton-X 100) or by enzymatic cleavage. Alternatively, cells employed in expression of Pro4l2- 
related proteins can be disrupted by various physical or chemical means, such as freeze-tliaw 
cycling, sonication, mechanical disruption, or by use of cell lysing agents. Exemplary purification 
methods include, but are not limited to, ion-exchange column chromatography; chromatography 
using silica gel or a cation-exchange resin such as DEAE; gel filtration using, for example, 
35 Sephadex G-75, protein A Sepharose columns to remove contaminants such as IgG; 

chromatography using metal chelating columns to bind epi tope-tagged forms of the Pro4 12-related 
protein; cthanol precipitation; reverse phase HPLC, chromatofocusing; SDS-PAGE; and 
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ammonium sulfate precipitation. Ordinarily, an isolated Pro4l2-related protein will be prepared by 
at least one purification step. For example, the Pro412-related protein may be purified using a 
standard anti-Pro412-related antibody column. Ultrafiltration and dialysis techniques, in 
conjunction with protein concentration, arc also useful (see, for example, Scopes, R., PROTEIN 
5 PURIFICATION, Springer-Verlag, New York, N.Y., 1982). The degree of purification necessary 
will vary depending on the use of the Pro412-rclatcd proteins. In some instances no purification 
will be necessary. Once expressed and purified as needed, the Pro412-related proteins and nucleic 
acids of the present invention are useful in a number of applications, as detailed below. 

10 Labeling of Expressed Protein. 

The nucleic acids, proteins and antibodies of the invention may be labeled. By labeled 
herein is meant that a compound has at least one element, isotope or chemical compound attached 
to enable the detection of the compound, hi general, labels fall into three classes: a) isotopic labels, 
which may be radioactive or heavy isotopes; b) immune labels, which may be antibodies or 

15 antigens; and c) colored or fluorescent dyes. The labels may be incorporated into the compound at 
any position that does not interfere with the biological activity or characteristic of the compound 
which is being detected. 

Pro412-related fusion Proteins. 

20 The Pro412-re!ated proteins of the present invention may also be modified in a way to 

form chimeric molecules comprising a Pro412-relatcd protein fused to another, heterologous 
polypeptide or amino acid sequence. The term "fusion protein" used herein includes but is not 
United to a chimeric polypeptide comprising a Prc412-related protein, or domain sequence thereof, 
fused to a 'targeting polypeptide". The targeting polypeptide has enough residues to facilitate 

25 targeting to a particular cell type or receptor, yet is short enough such that it does not interfere with 
the biological function of the Pro412-related protein. The targeting polypeptide preferably is also 
fairly unique so that the fusion protein does not substantially cross-react with other cell types or 
receptors. Suitable targeting polypeptides generally have at least about 10 amino acid residues and 
usually between from about 10 to about 500 amino acid residues. Preferred targeting polypeptides 

30 have from about 20 to about 200 amino acid residues. The fusion protein may also comprise a 
fusion of a Pro4l2-reIated protein with a tag polypeptide which provides an epitope to which an 
anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino-or 
carboxyl-terrninus of the Pro412-relatcd proteins. Such epitope-tagged forms of a Pro412-related 
protein can be detected using an antibody against the tag polypeptide. Also, provision of the 

35 epitope tag enables a Pro412-rclatcd protein to be readily purified by using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. Alternatively, the fusion protein may 
comprise a fusion of a Pro4!2-related protein with an immunoglobulin or a particular region of an 
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immunoglobulin. For a bivalent form of the chimeric molecule, such a fusion could be to the Fc 
region of an IgG molecule or, for example, GM-CSF. Preferred fusion proteins include, but are not 
limited to, molecules that facilitate immune targeting of a Pro412-related protein. The Pro412- 
related fusion protein may be made for various other purposes using techniques well known in the 
5 art. For example, for the creation of antibodies, if the desired epitope is small, a partial or complete 
Pro4l2-related protein may be fused to a carrier protein to form an immunogen. Alternatively, the 
Pro412-rclated protein may be made as a fusion protein to increase the ability of the antigen to 
stimulate cellular and/or humoral (antibody-based) immune responses, or for other reasons. 

10 Synthetic Genes for Pro412-related proteins. 

Once nucleic acid sequence and/or amino acid sequence information is available for a 
native protein a variety of techniques become available for producing virtually any mutation in the 
native sequence, c.g.Shortle, in Science, Vol. 229, pgs. 1 193-1201 (1985); Zoller and Smith, 
Methods in Enzymology, Vol. 100, pgs. 468-500 (1983); Mark et al., U.S. Patent 4,518,584; Wells 

15 et al, in Gene, Vol. 34, pgs. 315-323 (1985); Estell et aL, Science, Vol. 233, pgs. 659-663 (1986); 
Mullcnbachet20 al., J. Biol.Chern., Vol.26l,pgs.719-722(1986) > andFerettietal.,ProcNatl. 
Acad. Sci., Vol. 83, pgs.. 597-603 (1986). Accordingly, these references arc incorporated by 
reference. 

Variants of the natural polypeptide (sometime referred to as "muteins") may be desirable 

20 in a variety of circumstances For example, undesirable side effects might be reduced by certain 
variants, particularly if the side effect activity is associated with a different part of the polypeptide 
from that of the desired activity. In some expression systems, the native polypeptide may be 
susceptible to degradation by proteases. In such cases, selected substitutions and/or deletions of 
amino acids which change the susceptible sequences can significantly enhance yields, e.g. British 

25 patent application 2 1 73-804- A where Arg at position 275 of human tissue plasminogen activator is 
replaced by Gly or Glu. Variants may also increase yields in purification procedures and/or 
increase shelf lives of proteins by eliminating amino acids susceptible to oxidation, acylation, 
alkylation, or other chemical modifications. For example, methionines readily undergo oxidation 
to form sulfoxides, which in many proteins is associated with loss of biological activity. e.g. Brot 

30 and Weissbach, Arch. Biochem. Biophys., Vol. 223, pg. 27 1 (1 983). Often methionines can be 
replaced by more inert amino acids with little or no loss of biological activity, e.g. Australian 
patent application AU-A-52451/86. in bacterial expression systems* yields can sometimes be 
increased by eliminating or replacing confonnationaily inessential cystiene residues, e.g. Mark et 
al., U.S. Patent 4,518,584. 

35 Preferably cassette mutagenesis is employed to generate mutant proteins. A synthetic gene 

is constructed with a sequence of unique (when inserted in an appropriate vector) restriction 
cndonucleasc sites spaced approximately uniformly along the gene The unique restriction sites 
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allow segments of the gene to be conveniently excised and replaced with synthetic 
oligonucleotides (i.e. "cassettes") which code for desired mutations. Determination of the number 
and distribution of unique restriction sites entails the consideration of several factors including (I) 
preexisting restriction sites in the vector to be employed in expression, (2) whether species or 
5 genera-specific codon usage is desired, (3) the number of different non-vector-cutting restriction 
cndonuclcascs available (and their multiplicities within the synthetic gene), and (4) the 
convenience and reliability of synthesizing and/or sequencing the segments between the unique 
restriction sites. 

The above technique is a convenient way to effect conservative amino acid substitutions, and the 

10 like, in the native protein sequence. "Conservative" as used herein means (i) that the alterations 
are as conformationally neutral as possible, that is, designed to produce minimal changes in the 
tertiary structure of the mutant polypeptides as compared to the native protein, and (ii) that the 
alterations are as antigcnicaily neutral as possible, that is, designed to produce minimal changes in 
the antigenic determinants of the mutant polypeptides as compared to the native protein. . The 

15 following is a preferred categorization of amino acids into similarity classes: aromatic (phe, trp, 
tyr), hydrophobic (leu, ile, val), polar (gin, asn), basic (arg, tys, his), acidic (asp, glu), small (ala, 
scr, thr, met, gly). Conformational neutrality is desirable for preserving biological activity, and 
antigenic neutrality is desirable for avoiding the triggering of immunogenic responses in patients 
or animals treated with the compounds of the invention. While it is difficult to select with absolute 

20 certainty which alternatives will be conformationally and antigenically neutral, rules exist which 
can guide those skilled in the art to make alterations that have high probabilities of being 
conformationally and antigcnicaily neutral, e.g. Anfisen (cited above); Bcrzofsky, Science, Vol. 
229, pgs. 932-940 (1985); and Bowie et al, Science, Vol. 247, pgs. 1306-1310 (1990). Some of the 
more important rules include (1) substitution of hydrophobic residues are less likely to produce 

25 changes in antigenicity because they are likely to be located in the protein's interior, e.g. Berzofsky 
(cited above) and Bowie et al (cited above); (2) substitution of physiochcmically similar, i.e. 
synonymous, residues are less likely to produce conformational changes because the replacement 
amino acid can play the same structural role as the substituted amino acid; and (3) alteration of 
evolutionary ly conserved sequences is likely to produce deleterious conformational effects because 

30 evolutionary conservation suggests sequences may be functionally important In addition to such 
basic rules for selecting variant sequences, assays are available to confirm the biological activity 
and conformation of the engineered molecules. Biological assays for the polypeptides of the 
invention are described more fully above. Changes in conformation can be tested by at least two 
well known assays: the microcomplcmcnt fixation method, c g Wasscrman etal., J. Immunol , 

35 Vol. 87, pgs. 290-295 (1961), or Levine et al. Methods in Enzymology, Vol. 1 1, pgs. 928-936 
(1967) used widely in evolutionary studies of the tertiary structures of proteins; and affinities to 
sets of conformation-specific monoclonal antibodies, e.g. Lewis et al., Biochemistry, Vol. 22, pgs. 
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948-954(1983). 

Transgenic animals 

The host cells of the invention can also be used to produce nonhuman transgenic animals. For 

5 example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic 
stem ceil into which Pro412-rclatcd proteins-coding sequences have been introduced. Such host 
cells can then be used to create non-human transgenic animals in which exogenous Pro412-related 
proteins sequences have been introduced into their genome or homologous recombinant animals in 
which endogenous Prc412-related proteins sequences have been altered. Such animals arc useful 

10 for studying the function and/or activity of a Pro412-rclatcd protein or fragment thereof and for 
identifying and/or evaluating modulators of Pro412-related activity. As used herein, a "transgenic 
animal 11 is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or 
mouse, in which one or more of the cells of the animal includes a transgene. Other examples of 
transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, 

15 etc. A transgene is exogenous DNA which is integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the genome of the mature animal, thereby 
directing the expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, a "homologous recombinant animal" is a non-human animal, 
preferably a mammal, more preferably a mouse, in which an endogenous Pro4 12-rclated gene has 

20 been altered by homologous recombination between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

A transgenic animal of the invention can be created by introducing a Pro412-reIated 
proteins-encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g,, by 

25 microinjection or retroviral infection, and allowing the oocyte to develop in a pseudopregnant 
female foster animal. The Pro412-rclatcd proteins cDNA sequence or a fragment thereof can be 
introduced as a transgene into the genome of a non-human animal. Alternatively, a nonhuman 
homologue of a huma Pro412-related gene, such as a mouse or rat Pro412-related gene, can be 
used as a transgene. Intronic sequences and polyadenylation signals can also be included in the 

30 transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory 

sequencers) can be operably linked to a Pro4l2-related proteins transgene to direct expression of a 
Pro4l2-relatcd protein to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become conventional in 
the art and are described, for example, in U.S. Pat. Nos. 4,736,866 and 4,870,009, both by Leder et 

35 a)., U.S. Pat. No. 4,873,191 by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986, the disclosure of which is 
incorporated herein by reference in its entirety). Similar methods arc used for production of other 
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transgenic animals. A transgenic founder animal can be identified based upon the presence of a 
Pro412-rclatcd proteins transgene in its genome and/or expression of Pro412-rclated proteins 
rnRNA in tissues or cells of the animals. A transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene 
5 encoding a Pro412-related protein can further be bred to other transgenic animals carrying other 
transgenes. 

To create an animal in which a desired nucleic acid has been introduced into the genome 
via homologous recombination, a vector is prepared which contains at least a portion of a Pro412- 
rclated gene into which a deletion, addition or substitution has been introduced to thereby alter, 

10 e.g., functionally disrupt, the Pro4 12-related gene. The Pro412-rclatcd gene can be a human gene, 
but more preferably, is a non-human homologue of a huma Pro4 12-related gene (e.g., a cDNA 
isolated by stringent hybridization with a nucleotide sequence coding for a Pro4 12-related protein). 
For example, a mouse Pro412-rclatcd gene can be used to construct a homologous recombination 
vector suitable for altering an endogenous gene in the mouse genome. In a preferred embodiment, 

15 the vector is designed such that, upon homologous recombination, the endogenous Pro4 12-related 
gene is functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knock out" vector). Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous Pro4 12-related gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to thereby alter the 

20 expression of the endogenous Pro4 1 2-rcl ated protein). In the homologous recombination vector, 
the altered portion of the Pro4 1 2-rclated proteins or gene is flanked at its 5* and 3' ends by 
additional nucleic acid sequence of the Pro4 12-related gene to allow for homologous 
recombination to occur between the exogenous Pro4 12-related gene carried by the vector and an 
endogenous Pro4 12-related gene in an embryonic stem cell. The additional flanking Pro4 12-related 

25 nucleic acid sequence is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3* ends) are 
included in the vector (see e.g., Thomas, K. R. and Capecchi, M. R. (1987) Cell 51:503, the 
disclosure of which is incorporated herein by reference in its entirety, for a description of 
homologous recombination vectors). The vector is introduced into an embryonic stem cell line 

30 (e.g-, by electroporation) and cells in which the introduced Pro412-relaled gene has homologously 
rccombined with the endogenous Pro412-relaled gene are selected (see e.g., Li, E. et a!. (1992) 
Cell 69:915, the disclosure of which is incorporated herein by reference in its entirety). Hie 
selected cells arc then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation 
chimeras (see e.g., Bradley, A in Teratocarcinomas and Embryonic Stem Cells. A Practical 
35 Approach, E. J. Robertson, cd (IRL, Oxford, 1987) pp. 1 13-152, the disclosure of which is 

incorporated herein by reference in its entirety). A chimeric embryo can then be implanted into a 
suitable pscudopregnant female foster animal and the embryo brought to term. Progeny harboring 
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the homologously recombined DNA in their germ cells can be used to breed animals in which all 
cells of the animal contain the homologously recombined DNA by germline transmission of the 
transgene. Methods for constructing homologous recombination vectors and homologous 
recombinant animals are described further in Bradley, A. (1991) Current Opinion in Biotechnology 

5 2:823-829 and in PCT International Publication Nos.: WO 90/1 1354 by Le Mouellec et ah; WO 
91/01 140 by Smithies ct aL; WO 92/0968 by Zijlstra ct al.; and WO 93/04169 by Bems et al., the 
disclosures of which are incorporated herein by reference in their entireties. 

In another embodiment, transgenic non-human animals can be produced which contain 
selected systems which allow for regulated expression of the transgene. One example of such a 

10 system is the cre/loxP recombinasc system of bacteriophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et aL (1992) PNAS 89:6232-6236, the disclosure of which is 
incorporated herein by reference in its entirety. Another example of a recombinasc system is the 
FLP recombinasc system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351- 
1355, the disclosure of which is incorporated herein by reference in its entirety). If a cre/loxP 

1 5 recombinase system is used to regulate expression of the transgene, animals containing transgenes 
encoding both the Cre recombinase and a selected protein are required. Such animals can be 
provided through the construction of "double" transgenic animals, e.g., by mating two transgenic 
animals, one containing a transgene encoding a selected protein and the other containing a 
transgene encoding a recombinase. 

20 

>4w/j-Pro412-rclated antibodies 

The present invention provides antibodies and binding compositions specific for Pro412- 
related proteins. Such antibodies and binding compositions include polyclonal antibodies, 
monoclonal antibodies, Fab and single chain Fv fragments thereof, bispecific antibodies, 

25 heteroconjugates, humanized antibodies, and the like. Such antibodies and binding compositions 
may be produced in a variety of ways, including hybridoma cultures, recombinant expression in 
bacteria or mammalian cell cultures, recombinant expression in transgenic animals, and the like. 
There is abundant guidance in the literature for selecting a particular production methodology, e.g. 
Chadd and Chamow, Curr. Optn. Biotechnot., 12: 188-194 (2001). 

30 The choice of manufacturing methodology depends on several factors including the 

antibody structure desired, the importance of carbohydrate moieties on the antibodies, case of 
culturing and purification, cost, and the like. Many different antibody structures may be generated 
using standard expression technology, including full-length antibodies, antibody fragments, such as 
Fab and Fv fragments, as well as chimeric antibodies comprising components from different 

35 species. Antibody fragments of small size, such as Fab and Fv fragments, having no effector 
functions and limited pharmokinctic activity may be generated in a bacterial expression system. 
Single chain Fv fragments are highly selective for in vivo tumors, show good tumor penetration 
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and low immunogenicity, and are cleared rapidly from the blood, e.g. Freyre et al, J. BiotechnoL, 
76: 157-163 (2000). Thus, such molecules are desirable for radioimmunodectcction and in situ 
radiotherapy. Whenever pharmacokinetic activity in the form of increased half-life is required for 
therapeutic purposes, then a full-length antibodies arc preferable. For example, immunoglobulin G 

5 (IgG) the molecule may be one of four subclasses: y I , y2, y3, or y4. If a full-length antibody with 
effector function is required, then IgG subclasses yl or y3 arc preferred, and TgG subclass y 1 is 
most preferred. The y 1 and y3 subclasses exhibit potent effector function, complement activation, 
and promote antibody-dependent cell-mediated cytotoxicity through interaction with specific Fc 
receptors, e.g. Raju et al, Glycobiology, 10: 477-486 (2000); Lund et al, J. Immunol., 147: 2657- 

10 2662(1991). 

Polyclonal Antibodies. 

The anti-Pro412-related antibodies of the present invention may be polyclonal antibodies. 
Such polyclonal antibodies can be produced in a mammal, for example, following one or more 

15 injections of an immunizing agent, and preferably, an adjuvant. Typically, the immunizing agent 
and/or adjuvant will be injected into the mammal by a series of subcutaneous or intraperitoneal 
injections. The immunizing agent may include Pro412-related proteins or a fusion protein thereof. 
It may be useful to conjugate the antigen to a protein known to be immunogenic in the mammal 
being immunized. Examples of such immunogenic proteins include, but are not limited to, keyhole 

20 limpet hemocyanin (KLH), serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
Adjuvants include, for example, Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicoiyno-mycolate). The immunization protocol 
may be determined by one skilled in the art based on standard protocols or by routine 
experimentation. 

25 

Monoclonal Antibodies. 

Alternatively, the anti-Pro412-related antibodies may be monoclonal antibodies. 
Monoclonal antibodies may be produced by hybridomas, wherein a mouse, hamster, or other 
appropriate host animal, is immunized with an immunizing agent to elicit lymphocytes that 

30 produce or are capable of producing antibodies that will specifically bind to the immunizing agent, 
c g. Kohier and Milstetn, Nature 256:495 (1975). Alternatively, the lymphocytes may be 
immunized in vitro. The immunizing agent will typically include the Pro412-related proteins or a 
fusion protein thereof. Generally, spleen cells or lymph node cells arc used if non-human 
mammalian sources arc desired, or peripheral blood lymphocytes CPBLs") are used if cells of 

35 human origin. The lymphocytes are fused with an immortalized cell line using a suitable fusing 
agent, such as polyethylene glycol, to produce a hybridoma cell, c g Goding, MONOCLONAL 
ANTIBODIES: PRINCIPLES AND PRACTICE. Academic Press, pp 59-103 (1986); Liddell and 
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Oyer, A Practical Guide to Monoclonal Antibodies (John Wiley & Sons, New York, 199i); Malik 
and Lillenoj, Editors, Antibody Techniques (Academic Press, New York, 1994). In general, 
immortalized cell lines are transformed mammalian cells, for example, myeloma cells of rat, 
mouse, bovine or human origin. The hybridoma cells are cultured in a suitable culture medium that 

5 preferably contains one or more substances that inhibit the growth or survival of unfused, 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin, and thymidine (HAT), substances which prevent the growth of 
HGPRT-deficient cells. Preferred immortalized cell lines are those that fuse efficiently, support 

10 stable high level production of antibody, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine or human myeloma lines, which can be 
obtained, for example, from the American Type Culture Collection (ATCC), Rockvillc, MD. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies, e.g. Kozbor, J. Immunol. 133:3001 (1984); Brodeur 

15 et al. f Monoclonal Antibody Production Techniques and Applications. Marcel Dekker, Inc., New 
York, pp. 51-63 (1987). 

The culture medium (supernatant) in which the hybridoma cells are cultured can be 
assayed for the presence of monoclonal antibodies directed against a Pro412-related proteins. 
Preferably, the binding specificity of monoclonal antibodies present in the hybridoma supernatant 

20 is determined by immunoprecipitation or by an in vitro binding assay, such as radio- immunoassay 
(RIA) or enzyme-linked immunoabsorbent assay (ELISA). Appropriate techniques and assays arc 
known in the art The binding affinity of the monoclonal antibody can, for example, he determined 
by the Scatchard analysis of Munson and Pollard, Anal. Biochetrt 107:220 (1980). After the 
desired antibody-producing hybridoma cells are identified, the cells may be cloned by limiting 

25 dilution procedures and grown by standard methods [Goding, 1 986] Suitable culture media for 

this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMM640 medium. 
Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. The monoclonal 
antibodies secreted by selected clones may be isolated or purified from the culture medium or 
ascites fluid by immunoglobulin purification procedures routinely used by those of skill in the art 

30 such as, for example, protein A-Scpharosc, hydroxyl-apatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as 
those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be isolated from the Pro4l2-rclatcd proteins-specific hybridoma cells and sequenced, 

35 e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies. Once isolated, the DNA may be inserted into an 
expression vector, which is then transfected into host cells such as simian COS cells, Chinese 
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hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin 
protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
also may be modified, for example, by substituting the coding sequence for the human heavy and 
light chain constant domains for the homologous murine sequences [Morrison et aJ., Proc. Nat 

5 Acad. ScL 81:6851-6855 (1984); Neuberger et at., Nature 312:604-608 (1984); Takeda etal., 

Nature 3 14:452-454 (1985)], or by covalently joining to the immunoglobulin coding sequence all 
or part of the coding sequence for a non-immunoglobulin polypeptide. The non-imrnunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can be 
substituted for the variable domains of one antigen-combining site of an antibody of the invention 

10 to create a chimeric bivalent antibody. The antibodies may also be monovalent antibodies. 

Methods for preparing monovalent antibodies are well known in the art. For example, in vitro 
methods are suitable for preparing monovalent antibodies. Digestion of antibodies to produce 
fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques 
known in the art. 

15 Antibodies and antibody fragments characteristic of hybridomas of the invention can also 

be produced by recombinant means by extracting messenger RNA, constructing a cDNA library, 
and selecting clones which encode segments of the antibody molecule. The following are 
exemplary references disclosing recombinant techniques for producing antibodies: Wall et al., 
Nucleic Acids Research, Vol. 5, pgs. 31 13-3128 (1978); Zakut ct al., Nucleic Acids Research, Vol. 

20 8, pgs. 3591-3601 (1980); Cabilly et aL, Proc. Natl. Acad. Sci., VoL 81, pgs. 3273-3277 (1984); 
Boss et al., Nucleic Acids Research, Vol. 12, pgs. 3791-3806 (1984); Amster et al., Nucleic Acids 
Research, Vol. 8, pgs. 2055-2065 (1980); Moore et ah, U.S. Patent 4,642,334; Skerra et al. 
Science, Vol. 240, pgs. 1038-1041(1988); Huse ct al, Science, Vol. 246, pgs. 1275-1281 (1989); 
and U.S. patents 6,054,297; 5,530,101; 4,816,567; 5,750,105; and 5,648,237; which patents are 

25 incorporated by reference. In particular, such techniques can be used to produce interspecific 
monoclonal antibodies, wherein the binding region of one species is combined with non-binding 
region of the antibody of another species to reduce immunogenicity, e.g. Liu et al., Proc. Natl. 
Acad. Sci., Vol. 84, pgs. 3439-3443 (1987), and patents 6,054,297 and 5,530,101. Preferably, 
recombinantly produced Fab and Fv fragments are expressed in bacterial host systems. Preferably, 

30 full-length antibodies are produced by mammalian cell culture techniques. More preferably, full- 
length antibodies are expressed in Chinese Hampster Ovary (CHO) cells or NSO cells. 

Both polyclonal and monoclonal antibodies can be screened by ELISA. As in other solid 
phase immunoassays, the test is based on the tendency of macromolecules to adsorb 
nonspccifieally to plastic. The irreversibility of this reaction, without loss of immunological 

35 activity, allows the formation of antigen-antibody complexes with a simple separation of such 
complexes from unbound material. To titrate antipeptide serum, peptide conjugated to a carrier 
different from that used in immunization is adsorbed to the wells of a 96-well microliter plate. The 
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adsorbed antigen is then allowed to react in the wells with dilutions of anh-peptide serum. 
Unbound antibody is washed away, and the remaining antigen-antibody complexes are allowed to 
react with antibody specific for the IgG of the immunized animal, this second antibody is 
conjugated to an enzyme such as alkaline phosphatase. A visible colored reaction product 
5 produced when the enzyme substrate is added indicates which wells have bound antipeptidc 

antibodies. The use of spectrophotometer readings allows better quantification of the amount of 
peplide-speciftc antibody bound. High-titer antisera yield a linear titration curve between 10~ 3 and 
10~ 5 dilutions. 

1 0 Pro4 /2-related antibodies. 

The invention includes peptides derived from Pro41 2-related proteins, and immunogens 
comprising conjugates between carriers and peptides of the invention. The term immunogen as 
used herein refers to a substance which is capable of causing an immune response. The term 
carrier as used herein refers to any substance which when chemically conjugated to a peptide of 

15 the invention permits a host organism immunized with the resulting conjugate to generate 

antibodies specific for the conjugated peptide. Carriers include red blood cells, bacteriophages, 
proteins, or synthetic particles such as agarose beads. Preferably, carriers are proteins, such as 
scrum albumin, gamma-globulin, keyhole limpet hemocyanin, thyroglobulin, ovalbumin, 
fibrinogen, or the like. 

20 The general technique of linking synthetic peptides to a carrier is described in several 

references, e.g. Walter and Doolittle, " Antibodies Against Synthetic Peptides," in Setlow et al M 
eds., Genetic Engineering, Vol. 5, pgs. 61-91 (Plenum Press, N.Y., 1983); Green et ai. Cell, Vol. 
28, pgs. 477-487 (1982); Lerner et al, Proc. Natl. Acad. Sci., Vol. 78, pgs. 3403-3407 (1981); 
Shimizu et ah, U.S. Patent 4,474,754; and Ganficld et aL, U.S. Patent 4,31 1,639. Accordingly, 

25 these references arc incorporated by reference Also, techniques employed to link haptens to 
earners are essentially the same as the above-referenced techniques, e.g chapter 20 in Tijsseu 
Practice and Theory of Enzyme Immunoassays (Elsevier, New York, 1985). The four most 
commonly used schemes for attaching a peptide to a carrier arc (1) glutaraldehyde for amino 
coupling, e.g. as disclosed by Kagan and Glick, in Jafle and Behrman, eds. Methods of Hormone 

30 Radioimmunoassay, pgs. 328-329 (Academic Press, N.Y., 1979), and Walter et al. Proc. Natl. 

Acad. Sci., Vol. 77, pgs. 5197-5200 (1980); (2) water-soluble carbodiimides for carboxy! to amino 
coupling, e.g. as disclosed by Hoare et al , J. Biol. Chem., Vol. 242, pgs. 2447-2453 (1967); (3) 
bis-diazobenzidine (DBD) for tyrosine to tyrosine sidechain coupling, e.g. as disclosed by Bassiri 
et al , pgs. 46-47, in Jaffc and Behrman, eds. (cited above), and Walter et al. (cited above); and (4) 

35 malcimjdobcnzoyl-N-hydroxysuccinimidc ester (MBS) for coupling cysteine (or other sulihydryls) 
to ammo groups, e.g. as disclosed by Kitagawa ct al., J. Biochern. (Tokyo), Vol. 79, pgs. 233-239 
(1976), and Lcmcr et al. (cited above). A general rule for selecting an appropriate method for 
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coupling a given peptide to a protein carrier can be stated as follows; the group involved in 
attachment should occur only once in the sequence, preferably at the appropriate end of the 
segment For example, BDB should not be used if a tyrosine residue occurs in the main part of a 
sequence chosen for its potentially antigenic character. Similarly, centrally located lysines rule out 

5 the glutaraldchyde method, and the occurrences of aspartic and glutamic acids frequently exclude 
the carbodiimide approach. On the other hand, suitable residues can be positioned at either end of 
chosen sequence segment as attachment sites, whether or not they occur in the "native" protein 
sequence. Internal segments, unlike the amino and carboxy termini, will differ significantly at the 
"unattached end" from the same sequence as it is found in the native protein where the polypeptide 

10 backbone is continuous. The problem can be remedied, to a degree, by acetylating the a-amino 
group and then attaching the peptide by way of its carboxy terminus. The coupling efficiency to 
the carrier protein is conveniently measured by using a radiouc Lively labeled peptide, prepared 
either by using a radioactive amino acid for one step of the synthesis or by labeling the completed 
peptide by the iodination of a tyrosine residue. The presence of tyrosine in the peptide also allows 

15 one to set up a sensitive radioimmune assay, if desirable. Therefore, tyrosine can be introduced as 
a terminal residue if it is not part of the peptide sequence defined by the native polypeptide. 

Preferred carriers are proteins, and preferred protein carriers include bovine serum 
albumin, myoglobutin, ovalbumin (OVA), keyhole limpet hemocyanin (KLH), or the like. 
Peptides can be linked to KLH through cysteines by MBS as disclosed by Liu et al., Biochemistry, 

20 Vol. 1 8, pgs. 690-697 (1979). The peptides arc dissolved in phosphate-buffered saline (pH 7.5), 
0.1 M sodium borate buffer (pH 9.0) or 1.0 M sodium acetate buffer (pH 4.0). The pH for the 
dissolution of the peptide is chosen to optimize peptide solubility. The content of free cysteine for 
soluble peptides is determined by Ellman's method, Ellman, Arch. Biochem. Biophys., Vol. 82, pg. 
7077 (1959). For each peptide, 4 mg KLH in 0.25 ml of 10 mM sodium phosphate buffer (pH 7.2) 

25 is reacted with 0.7 mg MBS (dissolved in dimethyl formamide) and stirred for 30 min at room 
temperature. The MBS is added dropwise to ensure that the local concentration of formamide is 
not too high, as KLH is insoluble in >30% formamide. The reaction product, KLH-MBS, is then 
passed through Sephadex G-25 equilibrated with 50 mM sodium phosphate buffer (pH 6.0) to 
remove free MBS, KLH recovery from peak fractions of the column cluate (monitored by OD280) 

30 is estimated to be approximately 80%, KLH-MBS is then reacted with 5 mg peptide dissolved 25 
in 1 ml of the chosen buffer. The pH is adjusted to 7-7 5 and the reaction is stirred for 3 hr at 
room temperature. Coupling efficiency is monitored with radioactive peptide by dialysis of a 
sample of the conjugate against phosphate-buffered saline, and ranged from 8% to 60%. Once the 
peptidc-carrier conjugate is available polyclonal or monoclonal antibodies are produced by 

35 standard techniques, e.g. as disclosed by Campbell, Monoclonal Antibody Technology (Elsevier, 
New York, 1984); Hurrell, ed. Monoclonal Hybridoma Antibodies: Techniques and Applications 
(CRC Press, Boca Raton, FL, J 982); Schrcicr ct al. Hybridoma Techniques (Cold Spring Harbor 
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Laboratory, New York, 1980); VS. Patent 4,562,003; or the like. In particular, US. Patent 
4,562,003 is incorporated by reference. 

Humanized Antibodies. 

5 The anti-Pro412-rclated antibodies of the invention may further comprise humanized 

antibodies or human antibodies. The term "humanized antibody" refers to humanized forms of 
non-human (e.g.> murine) antibodies that are chimeric antibodies, immunoglobulin chains or 
fragments thereof (such as Fv, Fab, Fab', F(ab*), or other antigen-binding partial sequences of 
antibodies) which contain some portion of the sequence derived from non-human antibody. 

10 Humanized antibodies include human immunoglobulins in which residues from a complementary 
determining region (CDR) of the human immunoglobulin are replaced by residues from a CDR of 
a non-human species such as mouse, rat or rabbit having the desired binding specificity, affinity 
and capacity. In general, the humanized antibody will comprise substantially all of at least one, and 
generally two, variable domains, in which alt or substantially all of the CDR regions correspond to 

15 those of a non-human immunoglobulin and all or substantially all of the FR regions arc those of a 
human immunoglobulin consensus sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin [Jones et a!., Nature 321:522-525 (1986) and Presta, Cuw. Op. StvucL Biol 
2:593-S96 (1992)]. Methods for humanizing non-human antibodies are well known in the art. 

20 Generally, a humanized antibody has one or more ammo acids introduced into it from a source 
which is non-human in order to more closely resemble a human antibody, while still retaining the 
original binding activity of the antibody. Methods for humanization of antibodies are further 
detailed in Jones et ah, Nature 321:522-525 (1986); Riechmann et ah, Nature 332:323-327 (1988); 
and Vcrhocycn et at., Science 239:1534-1536 (1988). Such "humanized" antibodies are chimeric 

25 antibodies in that substantially less than an intact human variable domain has been substituted by 
the corresponding sequence from a non-human species. 

Heteroconjugate Antibodies. 

Heteroconjugate antibodies which comprise two covalently joined antibodies, arc also 
30 within the scope of the present invention. Heteroconjugate antibodies may be prepared in vitro 

using known methods in synthetic protein chemistry, including those involving crosslinking agents. 
For example, immunotoxins may be prepared using a disulfide exchange reaction or by forming a 
thiocthcr bond. 

35 Bispecijic Antibodies. 

Bispccific antibodies have binding specificities for at least two different antigens. Such 
antibodies arc monoclonal, and preferably human or humanized. One of the binding specificities of 
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a bispecific antibody of the present invention is for a Pro412-rclated proteins, and the other one is 
preferably for a cell-surface protein or receptor or receptor subunit. Methods for making bispecific 
antibodies are known in the art, and in general, the recombinant production of bispecific antibodies 
is based on the co-expression of two immunoglobulin hcavy-chain/light-chain pairs in hybridoma 
5 cells, where the two heavy chains have different specificities, e.g. Milstein and Cuello, Nature 
305:537-539 (1983). Given that the random assortment of immunoglobulin heavy and light chains 
results in production of potentially ten different antibody molecules by the hybridomas, 
purification of the correct molecule usually requires some sort of affinity purification, e.g. affinity 
chromatography. 

10 

Antibody antagonists. 

Preferably, antagonists of the invention are derived from antibodies specific for Pro412- 
related proteins. More preferably, the antagonists of the invention comprise fragments or binding 
compositions specific for Pro412-related proteins. Antibodies comprise an assembly of 

15 polypeptide chains linked together by disulfide bridges. Two major polypeptide chains, referred to 
as the light chain and the heavy chain, make up all major structural classes (isotypes) of antibody. 
Both heavy chains and light chains are further divided into subregions referred to as variable 
regions and constant regions. Heavy chains comprise a single variable region and three different 
constant regions, and light chains comprise a single variable region (different from that of the 

20 heavy chain) and a single constant region (different from those of the heavy chain). The variable 
regions of the heavy chain and light chain are responsible for the antibody's binding specificity. 
As used herein, the term "heavy chain variable region" means a polypeptide (I) which is from 1 10 
to 125 amino acids in length, and (2) whose amino acid sequence corresponds to that of a heavy 
chain of a monoclonal antibody of the invention, starting from the heavy chain's N-terminal amino 

25 acid. Likewise, the term "light chain variable region 11 means a polypeptide (1) which is from 95 to 
115 amino acids in length, and (2) whose amino acid sequence corresponds to that of a light chain 
of a monoclonal antibody of the invention, starting from the light chain's N-terminal amino acid. 
As used herein the term "monoclonal antibody 11 refers to homogeneous populations of 
immunoglobulins which arc capable of specifically binding to Pro4l2-related proteins. As used 

30 herein the term "binding composition" means a composition comprising two polypeptide chains (I) 
which, when operationally associated, assume a conformation having high binding affinity for 
Pro412-related proteins, and (2) which are derived from a hybridoma producing monoclonal 
antibodies specific for Pro412-related proteins. The term "operationally associated" is meant to 
indicate that the two polypeptide chains can be positioned relative to one another for binding by a 

35 variety of means, including by association in a native antibody fragment, such as Fab or Fv, or by 
way of genetically engineered cystcinc-containing peptide linkers at the carboxyl termini. 
Normally, the two polypeptide chains correspond to the light chain variable region and heavy chain 
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variable region of a monoclonal antibody specific for Pro412-related proteins. Preferably, 
antagonists of the invention are derived from monoclonal antibodies specific for Pro412-related 
proteins. Monoclonal antibodies capable of blocking, or neutralizing, Pre412-related proteins are 
selected by their ability to inhibit Pro4l2-related proteins-induced effects. 

The use and generation of fragments of antibodies is also well known, e.g. Fab fragments: 
Tijssen, Practice and Theory of Enzyme Immunoassays (Elsevier, Amsterdam, 1985); and Fv 
fragments: Hochman et al. Biochemistry, VoL 12, pgs. 1 130-1 135 (1973), Sharon et ah. 
Biochemistry, Vol. 15, pgs 159M594 (1976) and Ehrlich et al. r U.S. Patent 4,355,023; and 
antibody half molecules: Auditore- Hargrcavcs, VS. Patent 4,470,925. 

Preferably, monoclonal antibodies, Fv fragments, Fab fragments, or other binding 
compositions derived from monoclonal antibodies of the invention have a high affinity to Pro412- 
relatcd proteins. The affinity of monoclonal antibodies and related molecules to Pro412-rclated 
proteins may be measured by conventional techniques including plasmon resonance, ELISA, 
equilibrium dialysis, and the like. Affinity measurement by plasmon resonance techniques may be 
carried out, for example, using a BIAcore 2000 instrument (Biacorc AB, Uppsala, Sweden) in 
accordance with the manufacturer's recommended protocol. Preferably, is measured by ELISA., 
for example, as described in U.S. patent 6,235,883, or like reference. Preferably, the dissociation 
constant between Pro412-related proteins and monoclonal antibodies of the invention is less than 
10" 5 molar. More preferably, such dissociation constant is less than 10" 8 molar; still more 
preferably, such dissociation constant is less than iO" 9 molar; and most preferably, such 
dissociation constant is in the range of 10~ 9 to 10" molar. 

The invention includes kits for performing cither quantitative immunoassays or PCRs to 
detect the abundances of Pro4l2-related proteins or its mRNA transcripts, respectively. 
Preferably, monoclonal antibody kits of the invention includes at least one monoclonal antibody 
specific for Pro412-rclatcd proteins. Kits of the invention further include labeling means to detect 
the quantity of each monoclonal antibody bound to its target protein. Preferably, quantitative 
measurements are made using a conventional ELISA format, as described more fully below. 

The antibodies of the present invention find use in diagnostic assays for the determination 
of Pro412-rclated proteins. The antibodies of the invention may be used in most assays involving 
antigen-antibody reactions. The assays may be homogeneous or heterogeneous. In a homogeneous 
assay approach, the sample can be a biological sample or tluid such as serum, urine, whole blood, 
lymphatic fluid, plasma, saliva, and the like, cells, tissue, and material secreted by cells or tissues 
cultured in vitro. The sample can be pretreated if necessary to remove unwanted materials. The 
immunological reaction usually involves the specific antibody, labeled analytc, and the sample 
suspected of containing the analyte. The analyte can be directly labeled with the label or indirectly 
labeled with a means for incorporating the label such as conjugation of the analyte to biotin and 
having labeled avidin or anli-biotin. The signal from the label is modified, directly or indirectly, 
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upon the binding of the antibody of the labeled analyte. Both the immunological reaction and 
detection of the extent thereof are carried out in a homogeneous solution. Labels which may be 
employed as part of a signal producing system capable of producing a signal in relation to the 
amount of analyte in the sample include free radicals, chromogens, such as fluorescent dyes, 

5 chemiluminescers, enzymes, bacteriophages, coenzymes particulate labels and so forth. 

In a heterogeneous assay approach, the reagents are usually the sample, the specific 
antibody, and means for producing a detectable signal. The specimen is generally placed on a 
support, such as a plate or a slide, and contacted with the antibody in a liquid phase. The support is 
then separated from the liquid phase and either the support phase or the liquid phase is examined 

10 for a detectable signal employing meuns for producing such signal or signal producing system. The 
signal is related to the presence of the analyte in the sample. Means for producing a detectable 
signal includes the use of radioactive labels, fluorescein, enzymes, and so forth. Bxempary of 
heterogeneous immunoassays are the radioimmunoassay, immunofluorescence methods, enzyme- 
linked immunoassays, and the like. 

1 5 One embodiment of an assay employing an antibody of the present invention involves the 

use of a surface to which the monoclonal antibody of the invention is attached. The underlying 
structure of the surface may take different forms, have different compositions and may be a 
mixture of compositions or laminates or comthereof The surface may assume a variety of shapes 
and -forms and may have varied dimensions, depending on the manner of use and measurement. 

20 Illustrative surfaces may be pads, beads, discs, or strips which may be flat, concave or convex. 
Thickness is not critical, generally being from about 0.1 to 2 mm thick and of any convenient 
diameter or other dimensions, surface typically will be supported on a rod, tube, capillary, Fiber, 
strip, disc, plate, cuvette and the like. The surface will typically be porous and poiyfunctiona! or 
capable of being polyfunctionalized so as to permit covalent binding of the monoclonal antibody of 

25 the invention as well as to permit bonding of other compounds which form a part of a means for 
producing a detectable signal. A wide variety of organic and inorganic polymers, both natural and 
synthetic, and combinations thereof, may be employed as the material for the solid surface. 
Illustrative polymers include polyethylene, polypropylene, poly(4-methylbutcne), polystyrene, 
polymcthracrylate, poly(ethy!cne terephthalate), rayon, nylon, poly(vmyl butyrate), silicones, 

30 polyformaldehyde, cellulose, cellulose acetate, nitrocellulose, latex, etc. Other which may be 

employed include paper, glasses, ceramics, metals, metaloids, semiconductor materials, cermets, 
silicates or the like. Also included substrates that form gels, gelatins, lipopolysaccharides, silicates, 
agarose and polyacrylamides or polymers which form several aqeuous phases such as dcxtrans, 
polyalkylene glycols (alkylene of 2 to 3 carbon atoms) or surfactants such as phospholipids. The 

35 binding of the monoclonal antibody of the invention to the surface may be accomplished by well 
known, techniques, commonly available in the literature. Sec, for example, "Immobilized 
Enzymes/' Ichiro Chibata, Press, New York (1978) and Cuatrccasas, J Bio. Chem., 245: 3059 
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(1970). In carrying out the assay in accordance with this of the invention the sample is mixed with 
aqueous medium and the medium is contacted with the surface having a monoclonal antibody of 
the invention bound thereto. Members of a signal producing system and any ancillary materials 
may also be included in the aqueous medium, cither concurrently or added subsequently so as to 
provide a detectable signal associated with the surface. The means for producing the detectable 
signal can involve the incorporation in the of a labeled analytc or it may involve the use of a 
second monoclonal antibody having a label conjugated thereto. Separation and washing steps will 
be carried out as needed. The signal detected is related to the presence of Pro412-rclatcd proteins 
in the sample. It is within the of the present invention to include a calibration as the measurement 
surface on the same support A particular embodiment of an assay in accordance with the present 
invention, by way of illustration and not limitation, involves the use of a support such as a slide or 
a well of a petri dish. The technique involves the sample to be analyzed on the support with an 
appropriate fixing material such as acetone and incubating the sample on the slide with a 
monoclonal antibody of the invention. After washing with an appropriate buffer such as, for 
example, phosphate buffered saline, the support is contacted with a labeled specific binding 
partner for the analyte in the sample. After incubation as desired, the slide is washed a second time 
with an aqueous buffer and the determination is made of the binding of the labeled monoclonal 
antibody to the analytc. If the label is fluorescent, the slide may be covered with a fluorescent 
antibody mounting fluid on a cover slip and men examined with a fluorescent microscope to 
determine the extent of binding. On the other hand, the label can be an enzyme conjugated to the 
monoclonal antibody of the invention and the the extent of binding can be determined by 
examining the slide for the presence of enzyme activity, which may be indicated by the formation 
of a precipitate, a color, or the like. A particular example of an assay utilizing the present 
antibodies is a double determinant EL1SA assay. A support such as, e.g., a glass or vinyl plate, is 
coated with antibody specific for Pro412-related proteins by conventional techniques. The support 
is contacted with the sample suspected of containing Pro4l2-rcIatcd proteins, usually in a aqueous 
medium. After an incubation period from 30 seconds to 12 hours, the support is separated from the 
medium, washed to remove unbound Pro4l2-relatcd proteins with, for example, water or an 
aqueous buffered medium, and contacted with an antibody specific for Pro412-relatcd proteins, 
again usually in an aqueous medium. The antibody is labeled with an enzyme directly or indirectly 
such as, e.g., horseradish peroxidase or alkaline phosphatase After incubation, the support is 
separated from the medium, and washed as above. The enzyme activity of the support or the 
aqueous medium is determined. This enzyme activity is related to the amount of Pro412-related 
proteins in the sample. 

The invention also includes kits, e.g., diagnostic assay kits, for carrying out the methods 
disclosed above In one embodiment, the kit comprises in packaged combination (a) a monoclonal 
antibody more specifically defined above and (b) a conjugate of a specific binding partner for the 
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above monoclonal antibody and a label capable of producing a detectable signal. The reagents may 
also include ancillary agents such as buffering agents and protein stabilizing agents, e.g., 
polysaccharides and the like. The kit may further include, where necessary, other members of the 
signal producing system of which system the label is a member, agents for reducing background 
5 interference in a test, control reagents, apparatus for conducting a test, and the like. In another 

embodiment, the diagnostic kit comprises a conjugate of monoclonal antibody of the invention and 
a label capable of producing a detectable signal. Ancillary agents as mentioned above may also be 
present. 

10 An anti-Pro41 2-related antibody (e.g., monoclonal antibody) can be used to isolate Pro412- 

relatcd protein by standard techniques, such as affinity chromatography or imrnunoprecipitation. 
For example, an anti-Pro412-related antibody can facilitate the purification of natural Pro412- 
related proteins from cells and of recombinantly produced Pro4 1 2-related proteins expressed in 
host cells. Moreover, an anti-Pro412-rclatcd antibody can be used to detect Pro41 2-related protein 

15 (eg-, in a cellular lysate or cell supernatant) in order to evaluate the abundance and pattern of 
expression of the Pro4l2-reIated protein. An ti-Pro4! 2-related antibodies can be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of detectable 

20 substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, -galactosidase, or acetylcholinesterase; 
examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; 
examples of suitable fluorescent materials include umbcllifcronc, fluorescein, fluorescein 

25 isothiocyanate, rhodamine, m'chlorotriazmylamine fluorescein, dansyl chloride or phycoerythrin; 
an example of a luminescent material includes luminol; examples of bioluminescent materials 
include luciferasc, lucifcrin, and aequorin, and examples of suitable radioactive material include 
125 If l3l lf 35 S or3 H . 

30 Drug screening assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying inhibitors, i.e., candidate or test compounds or agents (e.g., small molecules and 
peptides, antibodies, peptidomimetics or other drugs) which bind to Pro41 2-related proteins, have 
an inhibitory or activating effect on, for example, Pro4 1 2-related proteins expression or preferably 

35 Pro4 1 2-related activity, or have an inhibitory or activating effect on, for example, the activity of a 
Pro41 2-related target molecule. Tn some embodiments small molecules can be generated using 
combinatorial chemistry or can be obtained from a natural products library. Assays may be cell 
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based or non-cell based assays. Drug screening assays may be binding assays or more 
preferentially functional assays, as further described. 

In other embodiments, an assay is a cell-based assay in which a cell which expresses a 
Pro412-related protein or biologically active portion thereof is contacted with a test compound and 

5 the ability of the test compound to inhibit, activate, or increase Pro4 1 2- related activity determined- 
Determining the ability of the test compound to inhibit, activate, or increase Pro4l2-related 
activity can be accomplished by monitoring the bioactivity of the Pro412-reIated protein or 
biologically active portion thereof. The cell, for example, can be of mammalian origin, insect 
origin, bacterial origin or a yeast cell. For example, in some embodiments, the cell can be a 

10 mammalian cell, bacterial cell or yeast cell which has been engineered to lack a natural inhibitor of 
a Pro412-rc)atcd proteins. 

The invention further encompasses compounds capable of inhibiting or activating Pro412- 
related activity. Preferably, a Pro4 I2-relatcd inhibitor or activator is a selective Pro4 1 2-related 
15 inhibitor or activator. Assays of the invention may be used to screen any suitable collection of 
compounds. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are target molecules of a Pro412-related protein or polypeptide or biologically 

20 active portion thereof. In another embodiment, the invention provides assays for screening 

candidate or test compounds which bind to or modulate the activity of a Pro412-related protein or 
polypeptide or biologically active portion thereof. The test compounds of the present invention can 
be obtained using any of the numerous approaches in combinatorial library methods known in the 
art, including: biological libraries; spatially addressable parallel solid phase or solution phase 

25 libraries; synthetic library methods requiring deconvolution; the "one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The biological 
library approach is used with peptide libraries, while the olher four approaches are applicable to 
peptide, non-peptide oligomer or small molecule libraries of compounds (Lam* K, S. (1997) 
Anticancer Drug Des. 12:145, the disclosure of which is incorporated herein by reference in its 

30 entirety). 

Examples of methods for the synthesis of molecular libraries can be found in die art, for 
example in: DeWitt et aL (1993) Proc. Natl. Acad. Sci. US A. 90 6909; Erb et aL (1994) Proc. 
Natl. Acad. Sci. USA 91:11 422; Zuckermann ct al. (1 994). J. Med. Chcm. 37:2678; Cho et ah 
(1993) Science 261:1303; Carrclletal. (1994) Angew Chem lnt Ed. Engl. 33:2059; Carell et ai. 
35 (1994) Angew. Chem. Inl Ed Engl. 33:2061; and in Gallop et al (1994) J. Med. Chem. 37:1233, 
the disclosures of which arc incorporated herein by reference in their entireties. 

Libraries of compounds may be presented in solution (e g. f Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) 
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Nature 364:555-556), bacteria (Ladner U.S. Pat. No. 5.223.409), spores (Ladner U.S. Pat No. 
•409). plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89: 1865-1869) or on phage (Scott and 
Smith (1990) Science 249:386-390); (Devin (1990) Science 249:404-406); (Cwirla et al. (1990) 
Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J. Mol. Biol. 222:301-310); (Ladner supra.), 
the disclosures of which are incorporated herein by reference in their entireties. 

Determining the ability of the test compound to inhibit or increase Pro412-related activity 
can also be accomplished, for example, by coupling the Pro412-rclatcd protein or biologically 
active portion thereof with a radioisotope or en^matic label such that binding of the Pro412- 
related protein or biologically active portion thereof to its cognate target molecule can be 
determined by detecting the labeled Pro412-related protein or biologically active portion thereof in 
a complex. For example, compounds (e.g., Pro412-related protein or biologically active portion 
thereof) can be labeled with l25 1 , 35 S, 14 C. or 3 H, either directly or indirectly, and the 
radioisotope detected by direct counting of radioemmission or by scintillation counting 
Alternatively, compounds can be enzymaticalry labeled with, for example, horseradish peroxidase, 
alkaline phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. The labeled molecule is placed in contact with 
its cognate molecule and the extent of complex formation is measured. For example, the extent of 
complex formation may be measured by immuno precipitating the complex or by performing gel 
electrophoresis. 

• It is also within the scope of this invention to determine the ability of a compound (e.g., 
Pro412-rclatcd protein or biologically active portion thereof) to interact with its cognate target 
molecule without the labeling of any of the intcractants. For example, a microphysiometer can be 
used to detect the interaction of a compound with its cognate target molecule without the labeling 
of either the compound or the target molecule. McConnell, H. M. ct al. (1 992) Science 257: 1 906- 
1912, the disclosure of which is incorporated herein by reference in its entirety. A 
microphysiometer such as a cytosensor is an analytical instrument that measures the rate at which a 
cell acidifies its environment using a light-addressable polentiometric sensor (LAPS). Changes in 
this acidification rate can be used as an indicator of the interaction between compound and 
receptor. 

In a preferred embodiment, the assay comprises contacting a cell which expresses a 
Pro412-related protein or biologically active portion thereof, with a target molecule to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the ability of 
the test compound to inhibit or increase the activity of the Pro412-related protein or biologically 
active portion thereof, wherein determining the ability of the test compound to inhibit or increase 
the activity of the Pro4 12-retated protein or biologically active portion thereof, comprises 
determining the ability of the test compound to inhibit or increase a biological activity of the 
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Pro41 2-related proteins expressing cell (e.g., determining the ability of the test compound to 
inhibit or increase transduction, protein rprotein interactions, proteolysis (as discussed above). 

In another preferred embodiment, the assay comprises contacting a cell which is 
responsive to a Pro412-relatcd protein or biologically active portion thereof, with a Pro41 2-related 
protein or biologically-active portion thereof, to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to modulate the 
activity of the Pro41 2-related protein or biologically active portion thereof, wherein determining 
the ability of the test compound to modulate the activity of the Pro412-rciatcd protein or 
biologically active portion thereof comprises determining the ability of the test compound to 
modulate a biological activity of the Pro4l2-rclatcd proteins-responsive cell (eg,, determining the 
ability of the test compound to modulate an indirect Pro412-related activity such as e.g. signal 
transduction or protehr.proiein interactions, or detennining the ability of the test compound to 
modulate antigen processing, cell proliferation or cell death). 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a Pro41 2-related target molecule (i.e. a molecule with which Pro412-rclated proteins 
interacts, preferably a molecule cleaved by Pro4 1 2-related proteins) with a test compound and 
determining the ability of the test compound to modulate (eg. stimulate or inhibit) the activity of 
the Pro412-related target molecule. Determining the ability of the test compound to modulate the 
activity of a Pro41 2-related target molecule can be accomplished, for example, by determining 
assessing the activity of a target molecule (that is, of a cleavage substrate or of the cleavage 
product), or by assessing the ability of the Pro41 2-related protein to bind to or interact with the 
Pro412-rclatcd target molecule. 

Determining the ability of the Pro41 2-related protein to bind to or interact with a Pro412- 
relatcd target molecule, such as for example a natural Pro4 1 2-related inhibitor, can be 
accomplished by one of the methods described above for determining direct binding. In a preferred 
embodiment, determining the ability of the Pro412-rcIatcd protein to bind to or interact with a 
Pro412-related target molecule can be accomplished by determining the activity of the target 
molecule. For example, the activity of the target molecule can be determined by contacting the 
target molecule with the Pro41 2-related protein or a fragment thereof and measuring induction of a 
cellular second messenger of the target (i.e. intracellular Ca2+, diacylglycerol, BP3, etc.), detecting 
catalytic/cnaymatic activity of the target an appropriate substrate, detecting the induction of a 
reporter gene (comprising a target-responsive regulatory element opcrativcly linked to a nucleic 
acid encoding a detectable marker, e.g., lucifcrasc), or detecting a target-regulated cellular 
response, for example, signal transduction or proteimprotcin interactions 

In yet another embodiment, an assay of the present invennon is a cell-free assay in which a 
Pro41 2-related protein or biologically active portion thereof is contacted with a test compound and 
the ability of the test compound to bind to the Pro41 2-related protein or biologically active portion 
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thereof is determined. Binding of the test compound to the Pro4 12-related protein can be 
determined either directly or indirectly as described above. In a preferred embodiment, the assay 
includes contacting the Pro412-re)ated protein or biologically active portion thereof with a known 
compound which binds Pro41 2 -related proteins (e.g., a Pro4 12-related target molecule) to form an 

5 assay mixture, contacting the assay mixture with a test compound, and determining the ability of 
the test compound to interact with a Pro4 12-related protein, wherein determining the ability of the 
test compound to interact with a Pro4 12-related protein comprises determining the ability of the 
test compound to preferentially bind to Pro412-rc1ated proteins or biologically active portion 
thereof as compared to the known compound. 

10 In another embodiment, the assay is a cell-free assay in which a Pro4 12-related protein or 

biologically active portion thereof is contacted with a test compound and the ability of the test 
compound to modulate (e.g„ stimulate or inhibit) the activity of the Pro412-related protein or 
biologically active portion thereof is determined. Determining the ability of the test compound to 
modulate the activity of a Pro412-related protein can be accomplished, for example, by 

15 determining the ability of the Pro4 12-re!ated protein to bind to a Pro4 U-relatcd target molecule by 
one of the methods described above for determining direct binding. Determining the ability of the 
Pro412-related protein to bind to a Pro4 12-related target molecule can also be accomplished using 
a technology such as real-time Biomolccular Interaction Analysis (B1A). Sjolander, S. and 
Urbaniczky, C. (1991) Anal. Chem. 63:2338-2345 and Szabo et al. (1995) Curr. Opin. Struct Biol. 

20 5:699-705, the disclosures of which are incorporated herein by reference in their entireties. As 
used herein, "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BIAcore). Changes in the optical phenomenon of surface 
plasmon resonance (SPR) can be used as an indication of real-time reactions between biological 
molecules. 

25 In an alternative embodiment, determining the ability of the test compound to modulate the 

activity of a Pro4 12-related protein can be accomplished by determining the ability of the Pro412- 
relatcd protein to further modulate the activity of a downstream effector (e.g., a growth factor 
mediated signal transduction pathway component) of a Pro4 12-related target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined or the 

30 binding of the effector to an appropriate target can be determined as previously described. 

In yet another embodiment, the cell-free assay involves contacting a Pro4 12-related protein 
or biologically active portion thereof with a known compound which binds the Pro4 12-related 
protein to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with the Pro4 12-related protein, wherein 

35 determining the ability of the test compound to interact with the Pro4 12-related protein comprises 
determining the ability of the Pro4 12-related protein to preferentially bind to or modulate the 
activity of a Pro4 12-related target molecule. 
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The cell-free assays of the present invention arc amenable to use of both soluble and/or 
membrane-bound forms of isolated proteins (e.g. Pro4l2-related proteins or biologically active 
portions thereof or molecules to which Pro4 12-related targets bind). In the case of cell-free assays 
in which a membrane-bound form an isolated protein is used it may be desirable to utilize a 

5 solubilizing agent such that the membrane-bound form of the isolated protein is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl- 
N-methylglucamide, Triton TM X-100, Triton TM X-l 14, Thesit TM, Isotridecypoly(ethylenc 
glycol ether)n,3-[(3-cholamidopropyl)dimethylamminio]- 1 -propane sulfonate (CHAPS), 3-[(3- 

10 cholaniidopropy!)dimeuiylarruTunio]--2-hydroxy-l-propanc sulfonate (CHAPSO), or N- 
dodecyHN,N-dimcthyl-3-ammonio- i-propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, it may 
be desirable to immobilize cither a Pro412-related protein or its target molecule to facilitate 
separation of complexcd from uncomplexed forms of one or both of the proteins, as well as to 

15 accommodate automation of the assay. Binding of a test compound to a Pro412-reIated protein, or 
interaction of a Pro4 12-rclated protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtitrc plates, test tubes, and micro-centrifuge tubes. In one 
embodiment, a fusion protein can be provided which adds a domain that allows one or both of the 

20 proteins to be bound to a matrix. For example, glutathione-S-transfcrase/Pro4 12-related fusion 
proteins or glutathione-S-transferase/target fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical, St. Louis, Mo.) or glutathione derivatized microtitre plates, 
which are then combined with the test compound or the test compound and either the non-adsorbed 
target protein or Pro4 12-related protein, and the mixture incubated under conditions conducive to 

25 complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the 
beads or microtitre plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, complex determined either directly or indirectly, for example, as 
described above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
Pro412-related proteins binding or activity determined using standard techniques. 

30 Other techniques for immobilizing proteins on matrices can also be used in the screening 

assays of the invention. For example, cither a Pro4 12-rclated protein or a Pro4 12-related target 
molecule can be immobilized utilizing conjugation of biotin and strcptavidin. Biotinylated Pro4 12- 
related protein or target molecules can be prepared from bioiin-NHS (N-hydroxy-succinimidc) 
using techniques well known in the art (e.g., biotinytation kit, Pierce Chemicals, Rockford, I1L), 

35 and immobilized in the wells of streptavidin-coatcd 96 well plates (Pierce Chemical). 

Alternatively, antibodies reactive with Pro4 12-rclated protein or target molecules but which do not 
interfere with binding of the Pro4 12-related protein to its target molecule can be derivatized to the 
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wells of the plate, and unbound target or Pro4 12-related protein trapped in the wells by antibody 
conjugation: Methods for detecting such complexes, in addition to those described above for the 
GST-immobilized complexes, include immunodetection of complexes using antibodies reactive 
with the Pro4 12-related protein or target molecule, as well as enzyme-linked assays which rely on 
5 detecting an enzymatic activity associated with the Pro4 12-related protein or target molecule. 

In another embodiment, modulators of Pro412-rc!atcd proteins expression are identified in 
a method wherein a cell is contacted with a candidate compound and the expression of Pro412- 
related proteins mRNA or protein in the cell is determined. The level of expression of Pro412- 
rclatcd proteins mRNA or protein in the presence of the candidate compound is compared to the 
10 level of expression of Pro4 12-related proteins mRNA or protein in the absence of the candidate 
compound. The candidate compound can then be identified as a modulator of Pro4 12-related 
proteins expression based on this comparison. For example, when expression of Pro412Hrclatcd 
proteins mRNA or protein is greater (statistically significantly greater) in the presence of the 
candidate compound than in its absence, the candidate compound is identified as a stimulator of 
15 Pro412-rclated proteins mRNA or protein expression. Alternatively, when expression of Pro412- 
related proteins mRNA or protein is less (statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate compound is identified as an inhibitor of 
Pro412-related proteins mRNA or protein expression The level of Pro4 12-related proteins mRNA 
or protein expression in the cells can be determined by methods described herein for delecting 
20 Pro412-related proteins mRNA or protein. 

In yet another aspect of the invention, the Pro4 12-related proteins can be used as "bail 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Pat. No. 5,283,3 1 7; Zervos et 
al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. 
(1993) Biotechniqucs 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; and Brent 
25 WO94/10300, the disclosures of which arc incorporated herein by reference in their entireties), to 
identify other proteins, which bind to or interact with Pro412-r elated proteins (*Pro412-rclatcd 
proteins-binding proteins" or "Pro4 12-related proteins-bp") and are involved in Pro4 12-related 
activity. Such Pro4 1 2-relatcd proteins-binding proteins arc also likely to be involved in the 
propagation of signals by the Pro4 12-related proteins or Pro4 12-related targets as, for example, 
30 downstream elements of a Pro4 1 2-related proteins-mediated signaling pathway. Alternatively, such 
Pro4 12-related proteins-binding proteins are likely to be Pro4 12-related inhibitors. 

The two-hybrid system is based on the modular nature of most transcription faclors, which 
consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two different 
DNA constructs. In one construct, the gene that codes for a Pro4 12-related protein or a fragment 
35 thereof is fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., 
GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that encodes 
an unidentified protein ("prey* ot "sample") is fused to a gene that codes for the activation domain 
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of the known transcription Factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
forming a Pro412-related proteins-dependent complex, the DNA-binding and activation domains of 
the transcription factor are brought into close proximity. This proximity allows transcription of a 
reporter gene (e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 

5 the transcription factor. Expression of the reporter gene can be detected and cell colonies 

containing the functional transcription factor can be isolated and used to obtain the cloned gene 
which encodes the protein which interacts with the Pro4 12-related protein. 

This invention further pertains to novel agents identified by the above-described screening 
assays and to processes for producing such agents by use of these assays. Accordingly, in one 

10 embodiment, the present invention includes a compound or agent obtainable by a method 

comprising the steps of any one of the aformentioned screening assays (e.g., cell-based assays or 
cell-free assays). For example, in one embodiment, the invention includes a compound or agent 
obtainable by a method comprising contacting a cell which expresses a Pro4 12-related target 
molecule with a test compound and the determining the ability of the test compound to bind to, or 

15 modulate the activity of, the Pro4 12-related target molecule. In another embodiment, the invention 
includes a compound or agent obtainable by a method comprising contacting a cell which 
expresses a Pro4 12-related target molecule with a Pro4 12-related protein or biologically-active 
portion thereof, to form an assay mixture, conlacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with, or modulate the activity of, the 

20 Pro4 12-related target molecule. In another embodiment, the invention includes a compound or 

agent obtainable by a method comprising contacting a Pro4 12-related protein or biologically active 
portion thereof with a test compound and determining the ability of the test compound to bind to, 
or modulate (e.g., stimulate or inhibit) the activity of, the Pro4 12-related protein or biologically 
active portion thereof. In yet another embodiment, the present invention included a compound or 

25 agent obtainable by a method comprising contacting a Pro4 12-related protein or biologically active 
portion thereof with a known compound which binds the Pro4 12-related protein to form an assay 
mixture, contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with, or modulate the activity of the Pro4 12-related protein. 

Accordingly, it is within the scope of this invention to further use an agent identified as 

30 described herein in an appropriate animal model. For example, an agent identified as described 
herein (e.g., a Pro412-rclated proteins modulating agent, an antisense Pro4 12-related nucleic acid 
molecule, a Pro4 12-related proteins-specific antibody, or a Pro4 12-related proteins-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side effects of 
treatment with such an agent Alternatively, an agent identified as described herein can be used in 

35 an animal model to determine the mechanism of action of such an agent Furthermore, this 

invention pertains to uses of novel agents identified by the above-described screening assays for 
treatments as described herein. 
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Tlic present inventon also pertains to uses of novel agents identified by the above- 
described screening assays for diagnoses, prognoses, and treatments as described herein. 
Accordingly, it is within the scope of the present invention to use such agents in the design, 
formulation, synthesis, manufacture, and/or production of a drug or pharmaceutical composition 
for use in diagnosis, prognosis, or treatment, as described herein. For example, in one embodiment, 
the present invention includes a method of synthesizing or producing a drug or pharmaceutical 
composition by reference to the structure and/or properties of a compound obtainable by one of the 
abovc-desenbed screening assays. For example, a drug or pharmaceutical composition can be 
synthesized based on the structure and/or properties of a compound obtained by a method in which 
a cell which expresses a Pro41 2-related target molecule is contacted with a test compound and the 
ability of the test compound to bind to, or modulate the activity of, the Pro412-relatcd target 
molecule is determined. In another exemplary embodiment, the present invention includes a 
method of synthesizing or producing a drug or pharmaceutical composition based on the structure 
and/or properties of a compound obtainable by a method in which a Pro41 2-related protein or 
biologically active portion thereof is contacted with a test compound and the ability of the test 
compound to bind to, or modulate (e.g., stimulate or inhibit) the activity of, the Pro41 2-related 
protein or biologically active portion thereof is determined. 

Pharmaceutical Compositions 

When polypeptides of the present invention are expressed in soluble form, for example as 
a secreted product of transformed yeast or mammalian cells, they can be purified according to 
standard procedures of the art, including steps of ammonium sulfate precipitation, ion exchange 
chromatography, gel filtration, electrophoresis, affinity chromatogrdphy, and/or the like, e.g. 
"Enzyme Purification and Related Techniques," Methods in Enzymology, 22:233-577 (1977), and 
Scopes, R. s Protein Purification: Principles and Practice (Springer-Verlag, New York, 1982) 
provide guidance in such purifications. Likewise, when polypeptides of the invention arc 
expressed in insoluble form, for example as aggregates, inclusion bodies, or the like, they can be 
purified by standard procedures in the art, including separating the inclusion bodies from disrupted 
host cells by centrifugation, solublizing the inclusion bodies with chaotropic and reducing agents, 
diluting the solubilized mixture, and lowering the concentration of chaotropic agent and reducing 
agent so that the polypeptide takes on a biologically active conformation. The latter procedures 
are disclosed in the following references, which are incorporated by reference. Winkler et al. 
Biochemistry, 25: 4041-4045 ( 1 986), Winkler el al, Biotechnology, 3: 992-998 ( 1 985); Koths ct al, 
U.S. patent 4,569,790; and European patent applications 86306917.5 and 86306353.3. 

As used herein "effective amount" means an amount sufficient to ameliorate a symptom of 
an autoimmune condition. The effective amount for a particular patient may vary depending on 
such factors as the state of the condition being treated, the overall health of the patient, method of 
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administration, the severity of side-efTccts, and the like. 

Pro4l2-related proteins can be purified from culture supernatants of mammalian cells 
transiently transfectcd or stably transformed by an expression vector carrying a Pro412-relatcd 
gene. Preferably, Pro412-rclated proteins is purified from culture supernatants of COS 7 cells 

5 transiently transfected by the pcD expression vector. Transfection of COS 7 cells with pcD 
proceeds as follows: One day prior to transfection, approximately 10^ COS 7 monkey cells are 
seeded onto individual 100 mm plates in Duibecco's modified Eagle medium (DME) containing 
10% fetal calf scrum and 2 mM glutamine. To perform the transfection, the medium is aspirated 
from each plate and replaced with 4 ml of DME containing 50 mM Tris.HCl pH 7.4, 400 mgAnl 
• 10 DE AE-Dextran and 50 jig of plasmid DNA. The plates arc incubated for four hours al 37°C, then 
the DNA-containing medium is removed, and the plates are washed twice with 5 ml of serum-free 
DME. DME is added back to the plates which are then incubated for an additional 3 hrs at 37°C. 
The plates arc washed once with DME, after which DME containing 4% fetal calf serum, 2 mM 
glutamine, penicillin (100U/L) and streptomycin (100 ug/L) at standard concentrations is added 

15 The cells are then incubated for 72 hrs at 37°C, after which the growth medium is collected for 
purification of Pro4 1 2 -related proteins. Alternatively, transfection can be accomplished by 
electroporation as described in the examples. Plasmid DNA for the transfections is obtained by 
growing pcD(SRa), or like expression vector, containing the Pro412-related proteins cDNA insert 
in E. coli MC1061, described by Casadaban and Cohen, J Mol. Biol-, Vol. 138, pgs. 179-207 

20 (1980), or like organism. The plasmid DNA is isolated from the cultures by standard techniques, 
eg. Sambrook et at, Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring 
Harbor Laboratory, New York, 1989) or Ausubel et al (1990, cited above). 

Compounds capable of inhibiting a Pro412-rclatcd activity, preferably small molecules but 
25 also including peptides, Pro412-rclated nucleic acid molecules, Pro412-related proteins, and anti- 
Pro412-rclated antibodies (also referred to herein as "active compounds") of the invention can be 
incorporated into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise a pharmaceutical^ acceptable carrier. As used herein the language 
"pharmaceutical ly acceptable carrier" is intended to include any and all solvents, dispersion media, 
30 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and agents for 
pharmaceutical^ active substances is well known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the compositions. 
35 A pharmaceutical composition of the invention is formulated to be compatible with its 

intended route of administration. Examples of routes of administration include parenteral, e.g., 
intravenous, intradermal, subcutaneous, oral (e g , inhalation), transdermal (topical), transmucpsal, 
and rectal administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent such as water for 
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injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other 
synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 
such as ascorbic acid or sodium bisulfite; chelating agents such as ethylencdiaminetetraacetic acid; 
buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as 
sodium chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or 
multiple dose vials made of glass or plastic 

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor EL® (BASF, Parsippany, NJ.) or phosphate 
buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent 
that easy syringability exists. It must be stable under the conditions of manufacture and storage and 
must be preserved against the contaminating action of microorganisms such as bacteria and fungi. 
The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
(for example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and 
suitable mixtures thereof The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. Tn many cases, it will be preferable to include isotonic agents, for 
example, sugars, polyalcohols such as manito), sorbitol, sodium chloride in the composition. 
Prolonged absorption of the injectable compositions can be brought about by including in the 
composition an agent which delays absorption, for example, aluminum monostcarate and gelatin. 

Where the active compound is a protein, peptide or anti-Pro412-relatcd antibody, sterile 
injectable solutions can be prepared by incorporating the active compound (e.g., ) in the required 
amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are vacuum drying 
and freczc-drying which yields a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. For administration by inhalation, the compounds arc delivered in the 
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form of an aerosol spray from pressured container or dispenser which contains a suitable 
propel lant, e.g., a gas such as carbon dioxide, or a nebulizer. Systemic administration can also be 
by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants 
appropriate to the barrier to be permeated are used in the formulation. Such penetrants are 

5 generally known in the art, and include, for example, for transmucosal administration, detergents, 
bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For transdermal administration, die active compounds are 
formulated into ointments, salves, gels, or creams as generally known in the art. Most preferably, 
active compound is delivered to a subject by intravenous injection. 

10 In one embodiment, the active compounds arc prepared with carriers that will protect the 

compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, poryanhydrides, polyglycolic acid, collagen, 
polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent 

15 to those skilled in the art The materials can also be obtained commercially from Alza Corporation 
and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ acceptable 
carriers. These can be prepared according to methods known to those skilled in the art, for 
example, as described in U.S. Pat No. 4,522,81 1 , the disclosure of which is incorporated herein by 

20 reference in its entirety. 

It is especially advantageous to formulate oral or preferably parenteral compositions in 
dosage unit form for case of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active compound calculated to produce the 

25 desired therapeutic effect in association with the required pharmaceutical carrier. The specification 
for the dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an active compound for the treatment of 
individuals. 

30 Toxicity and therapeutic efficacy of such compounds can be determined by standard 

pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 
(the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% 
of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index 
and it can be expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic 

35 indices are preferred. While compounds that exhibit toxic side effects may be used, care should be 
taken to design a delivery system that targets such compounds to the site of affected tissue in order 
to minimize potential damage to uninfected cells and, thereby, reduce side effects. 

.67 



603^4-576 - O 7 OHCS 



The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the ED50 with little or NO toxicity. The 
dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. For any compound used in the method of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays. A dose may be formulated in 
animal models to achieve a circulating plasma concentration range that includes the 1C50 (i.e., the 
concentration of the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine useful doses 
in humans. Levels in plasma may be measured, for example, by high performance liquid 
chromatography. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

Methods of Treatment 

Provided herein arc treatment methods based on the Pro412-related proteins of the invention. Both 
active Pro412-related proteins as provided in SEQ ID NOS 1 to 4, as well as inhibitors of Pro412- 
related proteins such as antibodies may be useful in the treatment or prevention of disease. ' 

Modulation (increasing or decreasing) of Pro4l2-related activity can be carried out in a number of 
suitable ways, several of which have been described in the present application. For example, 
methods of treatment may involve modulating a Pro412Hrclated activity, biological activity of a 
Pro412-related protein or functional activity of a Pro412-related protein. Modulating Pro4l2- 
related activity may involve modulating an association with a Pro412-related-target molecule (e.g. 
a substrate) or preferably any other Pro412-relatcd activity as discussed herein. Preferably 
modulating a Pro4l2-related activity comprises modulating proteolytic activity. 

In one aspect, the Pro4 i 2-relatcd proteins of the invention may be used for stimulating endothelial 
cell growth in a mammal, particularly for the stimulation of angiogenesis. In another aspect, a 
Pro412-related protein may be useful in diagnosing a cardiovascular, angiogenic or endothelial 
disorder in a mammal which comprises analyzing the level of expression of a nucleic acids 
encoding the Pro4 12-related proteins of the invention, or analyzing the level of Pro412-related 
proteins of the invention. 

In one embodiment, vascularization of tumors is attacked using antibodies that bind specifically 
to the Pro412-rclatcd proteins of the invention. The anti- Pro41 2-relatcd antibody and another arc 
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administered to tumor-bearing patients at therapeutically effective doses as determined, for 
example, by observing necrosis of the tumor or its metastatic foci, if any. This therapy is continued 
until such time as no further beneficial effect is observed or clinical examination shows no trace of 
the tumor or any metastatic foci. 

In other aspect, thePro4l2-related protein or agonists or antagonists thereto may also be 
employed for their cell growth stimulating activity to stimulate erythropoiesis or granulopoiesis, to 
stimulate wound healing or tissue regeneration and associated therapies concerned with re-growth 
of tissue, such as connective tissue, skin, bone, cartilage, muscle, liver or kidney, to promote 
angiogencsis, to stimulate growth of endothelial cells, and to proliferate the growth of vascular 
smooth muscle and endothelial cell production. The increase in angiogencsis mediated by the 
Pro412-related protein or antagonist would be beneficial for the treatment of ischemic tissues and 
to collateral coronary development in the heart subsequent to coronary stenosis. 
A Pro412-related protein can be used for its angiogenic activity in peripheral vascular disease, 
hypertension. Inflammatory vasculites, Reynaud's disease and Rcynaud's phenomenon, 
aneurysms, arterial restenosis, wound healing and tissue repair, ischemia reperfusion injury, 
angina, myocardial infarctions such as acute myocardial infarctions, chronic heart conditions, heart 
failure such as congestive heart failure, and osteoporosis. Hypertension for example is 
characterized by raised vascular pressure in the systemic arterial, pulmonary arterial, or 
portalvenous systems. Elevated pressure may result from or result in impaired endothelial function 
and/or vasculardiscasc. In other aspects, antibodies of antagonists to Pro412-related proteins can 
be used for inhibiting endothelial cell growth, and thus for the treatment of chronic inflammatory 
disorders such as rheumatoid arthritis or Crohn's disease 



Pro412-reIated proteins can also be used for stimulating the activity of T-cells, preferably in 
methods comprising contacting CD4- cells or PBMC cells with an effective amount of a Pro4l2- 
relaled protein. In preferred method, the invention comprises administering to the mammal an 
effective amount of a Pro412-relatcd protein of the invention for the treatment of a tumor. In other 
aspects an antagonist of a Pro412-related protein is used for the treatment of an the immune related 
disorder is selected from the group consisting of systemic lupus erythematosis, rheumatoid 
arthritis, osteoarthritis, juvenile chronic arthritis, idiopathic inflammatory myopathies, Sjogren's 
syndrome, autoimmune thrombocytopenia, thyroiditis, diabetes mellirus, immune-mediated renal 
disease, dcmyclinating diseases of the central and peripheral nervous systems such as multiple 
sclerosis, inflammatory bowel disease, psoriasis ,allergic diseases such as asthma, allergic rhinitis, 
atopic dermatitis, and transplantation associated diseases including graft rejection and graft - 
vcrsus-host-discase. 



In yet further aspects, the Pro4 12-related proteins of the invention may be used for promoting the 
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survival of retinal cells. Thus, encompassed are methods of delaying, preventing or rescuing 
photoreceptor cells from injury or death in a mammal comprising administrating a therapeutically 
effective amount of a Pro412- related protein to said mammal. Pro412- related protein can be used 
to delay, prevent or rescue retinal cells (including retinal neurons; e.g., photoreceptor cells, other 
retinal neurons, and supportive cells from injury or death. Other retinal neurons include, but are 
not limited to retinal ganglionitis, displaced retinal ganglion cells, amacrine cells, displaced 
amacrine cells, horizontal and bipolar neurons. Supportive cells include, e.g., Muller cells or RPE 
cells. Pro41 2- related proteins to treat any condition which results in injury or death or 
photoreceptor cells or other retinal cells. Examples of suchconditions include: retinal detachment; 
age-related and other maculopathics; photic retinopathies, surgery-induced retinopathies (either 
mechanically or light-induced), toxic retinopathies including those resultingfrom foreign bodies in 
the eye; diabetic retinopathies; retinopathy of prematurity, viral retinopathies such asCMV or HIV 
retinopathy related to AIDS; uveitis; ischemic retinopathies due to venous or arterial occlusionor 
other vascular disorder, retinopathies due to trauma or penetrating lesions of the eye; peripheral 
vitreoretinopathy; and inherited retinal degenerations. Excmplery retinal degenerations include 
e.g., hereditary spastic paraplegia with retinal degeneration (Kjellin and Bamard-Scholtz 
syndromes), retinitis pigmentosa, Stargardt disease, Usher syndrome (retinitis pigmentosa with 
congenital hearing loss), and Refsum syndrome (retinitis pigmentosa, hereditary hearing loss and 
polyneuropathy). Additional disorders which result in death or retinal neurons include, retinal 
20 tears, detachment of the retina and pigment epithelium, degenerative myopia, acute retinal necrosis 
syndrome (ARN), traumatic chorioretinopathies or contusion (Purtscher's Retinopathy) and edema. 

In view of their role in stimulating growth of skin cells, Pro41 2- related proteins can also be used 
in methods for treating skin disorders. Examples of skin disorders include skin cancers (e.g., 
melanomas), eczema, and psoriasis. Similarly, polypeptides and antibodies directed to these 
polypeptides arcuscful in providing immunological probes for differential identification of 
thesetissue(s) or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skin, expression of this gene at significantly higher or lower levels may be routinely detected in 
certain tissues (e.g„ skin and cancerous and wounded tissues) or bodily fluids (e.g., semm, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level. 

Diagnostic and Prognostic Uses 

The nucleic acid molecules, proteins, protein homologucs. and antibodies described herein 
can be used in one or more of the following methods, diagnostic assays, prognostic assays, 
monitoring clinical trials, and pharmacogenetics; and in drag screening and methods of treatment 
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(e.g., therapeutic and prophylactic) as further described herein. 

The invention provides diagnostic and prognosttc assays for detecting Pro4 1 2-related 
nucleic acids and proteins* as further described. Also provided are diagnostic and prognostic 
assays for detecting interactions between Pro412-related proteins and Pro4 1 2-related target 
5 molecules, particularly natural inhibitors or cleavage subtrates. 

The isolated nucleic acid molecules of the invention can be used, for example, to detect 
Pro41 2-related proteins mRNA (e.g., in a biological sample) or a genetic alteration in a Pro412- 
related gene, and to modulate a Pro41 2-related activity, as described further below. The Pro412~ 
related proteins can be used to treat disorders characterized by insufficient or excessive production 
10 of a Pro41 2-related protein or Pro4 1 2-related target molecule. In addition, the Pro412-related 
proteins can be used to screen for naturally occurring Pro4 1 2-related target molecules, to screen 
for drugs or compounds which modulate, preferably inhibit Pro412-rclatcd activity, as well as to 
treat disorders characterized by insufficient or excessive production of Pro41 2-related protein or 
production of Pro41 2-related protein forms which have decreased or aberrant activity compared to 
15 Pro4 i 2-related proteins wild type protein. Moreover, the anti-Pro4 Unrelated antibodies of the 

invention can be used to detect and isolate Pro4 1 2-related proteins, regulate the bioavailability of 
Pro41 2-related proteins, and modulate Pro4 1 2-related activity. 

Accordingly one embodiment of the present invention involves a method of use (eg,, a 
diagnostic assay, prognosttc assay, or a prophylactic/therapeutic method of treatment) wherein a 
20 molecule of the present invention (e.g., a Pro41 2-related protein, Pro41 2-related nucleic acid, or 
most preferably a Pro4 1 2-related inhibitor or activator) is used, for example, to diagnose, prognose 
and/or treat a disease and/or condition in which any of the aforementioned Pro41 2-related proteins 
activities is indicated. In another embodiment, the present invention involves a method of use (e.g., 
a diagnostic assay, prognostic assay, or a prophylactic/therapeutic method of treatment) wherein a 
25 molecule of the present invention (e.g., a Pro41 2-related protein, Pro412-rclated nucleic acid, or a 
Pro4 1 2-related inhibitor or activator) is used, for example, for the diagnosis, prognosis, and/or 
treatment of subjects, preferably a human subject, in which any of the aforementioned activities is 
pathologically perturbed. Tn a preferred embodiment, the methods of use (e.g., diagnostic assays, 
prognostic assays, or prophylactic/therapeutic methods of treatment) involve administering to a 
30 subject, preferably a human subject, a molecule of the present invention (e.g., a Pro4 1 2-related 
protein, Pro41 2-related nucleic acid, or a Prc41 2-related inhibitor or activator) for the diagnosis, 
prognosis, and/or therapeutic treatment. In another embodiment, the methods of use (e.g., 
diagnostic assays, prognostic assays, or prophylactic/therapeutic methods of treatment) involve 
administering to a human subject a molecule of the present invention (e.g , a Pro41 2-related 
35 protein, Pro41 2-related nucleic acid, or a Pro4I2-rclatcd inhibitor or activator). 

For example, the invention encompasses a method of determining whether a Pro412- 
related proteins member is expressed within a biological sample comprising: a) contacting said 
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biological sample with: ii) a polynucleotide that hybridizes under stringent conditions to a Pro412- 
related nucleic acid; or iii) a detectable polypeptide (e.g. antibody) that selectively binds to a 
Pro4l2-relatcd protein; and b) detecting the presence or absence of hybridization between said 
polynucleotide and an RNA species within said sample, or the presence or absence of binding of 

5 said detectable polypeptide to a polypeptide within said sample. A detection of said hybridization 
or of said binding indicates that said Pro4l2-related proteins member is expressed within said 
sample. Preferably, the polynucleotide is a primer, and wherein said hybridization is detected by 
detecting the presence of an amplification product comprising said primer sequence, or the 
detectable polypeptide is an antibody. 

10 Also envisioned is a method of determining whether a mammal, preferably human, has an 

elevated or reduced level of expression of a Pro412-related proteins member, comprising: a) 
providing a biological sample from said mammal; and b) comparing the amount of a Pro412- 
related protein or of a Pro4 1 2-related proteins RNA species encoding a Pro4 1 2-related protein 
within said biological sample with a level detected in or expected from a control sample. An 

15 increased amount of said Pro41 2-related protein or said Pro41 2-related proteins RNA species 
within said biological sample compared to said level detected in or expected from said control 
sample indicates that said mammal has an elevated level of Pro412-related proteins expression, and 
a decreased amount of said Pro4l2-relatcd protein or said Pro41 2-related proteins RNA species 
within said biological sample compared to said level detected in or expected from said control 

20 sample indicates that said mammal has a reduced level of expression of a Pro4 1 2-related proteins 
member. 

The present invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, and monitoring clinical trials are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically. Accordingly, one aspect of the present 

25 invention relates to diagnostic assays for determining Pro412-relatcd protein and/or nucleic acid 
expression as well as Pro41 2-related activity, in the context of a biological sample (e.g., blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant Pro4 1 2-related proteins 
expression or activity. The invention also provides for prognostic (or predictive) assays for 

30 determining whether an individual is at risk of developing a disorder associated with a Pro4 1 2- 
related protein, nucleic acid expression or activity. For example, mutations in a Pro412-rclated 
gene can be assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby phophylaclically treat an individual prior to the onset of a disorder 
characterized by or associated with a Pro41 2-related protein, nucleic acid expression or activity. 

35 Accordingly, the methods of the present invention are applicable generally to diseases 

related to regulation of proteolysis, including but not limited to disorders' characterized by 
unwanted cell proliferation or generally aberrant control of differentiation, for example neoplastic 
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or hyperplastic disorders, as well as disorders related to proliferation or lack thereof of endothelial 
cells, inflammatory disorders and neurodegenerative disorders. 

Diagnostic Assays 

5 

An exemplary method for detecting the presence (quantitative or not) or absence of a 
Pro412-relatcd protein or nucleic acid in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting a Pro41 2-related protein or nucleic acid (e.g., mRNA, Pro4l2-related proteins 

10 genomic DNA) that encodes Pro41 2-related protein such that the presence of the Pro412-relaled 
protein or nucleic acid is detected in the biological sample. A preferred agent for detecting a 
Pro412-related proteins mRNA or genomic DNA is a labeled nucleic acid probe capable of 
hybridizing to a Pro412-reIated proteins mRNA or Pro4 1 2-related proteins genomic DNA. The 
nucleic acid probe can be, for example, a full-length Pro41 2-related nucleic acid, or a probe 

15 sufficient in length to specifically hybridize under stringent conditions to a Pro41 2-related proteins 
mRNA or genomic DNA or a portion of a Pro4l2-relatcd nucleic acid. Other suitable probes for 
use in the diagnostic assays of the invention arc described herein. 

In preferred embodiments, the subject method can he characterized by generally 
comprising detecting, in a tissue sample of the subject (e.g. a human patient), the presence or 

20 absence of a genetic lesion characterized by at least one of (0 a mutation of a gene encoding one of 
the subject Pro4 1 2-related proteins or (ii) the mis-expression of a Pro4l2-relatcd gene. To 
illustrate, such genetic lesions can be detected by ascertaining the existence of at least one of (i) a 
deletion of one or more nucleotides from a Pro41 2-related gene, (ii) an addition of one or more 
nucleotides to such a Pro41 2-related gene, (iii) a substitution of one or more nucleotides of a 

25 Pro4 1 2-related gene, (iv) a gross chromosomal rearrangement or amplification of a Pro4 1 2-related 
gene, (v) a gross alteration in the level of a messenger RNA transcript of a Pro41 2-related gene, 
(vi) aberrant modification of a Pro41 2-related gene, such as of the methylation pattern of the 
genomic DNA, (vii) the presence of a non-wild type splicing pattern of a messenger RNA 
transcript of a Pro41 2-related gene, and (viii) a non-wild type level of a Pro41 2-related target 

30 protein. 

A preferred agent for detecting a Pro412-rclatcd protein is an antibody capable of 
selectively binding to a Pro4 1 2-related protein, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment 
thereof (e g., Fab or F(ab l ) 2 ) can be used. The term "labeled", with regard to the probe or antibody, 
35 is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically 

linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling 
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include detection of a primary antibody using a fluorescentiy labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be detected with fluorescentiy labeled 
slreptavidin. The term "biological sample" is intended to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. That is, the 

5 detection method of the invention can be used to detect a Pro412-related proteins mRNA, protein, 
or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro techniques 
for detection of a Pro4 12-related proteins mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a Pro412-rclatcd protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. 

10 In vitro techniques for detection of a Pro4 12-related proteins genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for detection of a Pro4 12-related protein include 
introducing into a subject a labeled anti-Pro4 12-rclatcd antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be detected by 
standard imaging techniques. 

15 In yet another exemplary embodiment, aberrant methy lation patterns of a Pro4 1 2-related 

nucleic acid can be detected by digesting genomic DNA from a patient sample with one or more 
restriction endonuclcascs that are sensitive to methylation and for which recognition sites exist in 
the Pro4 12-related gene (including in the flanking and intronic sequences). See, for example, 
Buiting et al. (1994) Human Mol Genet 3:893-895. Digested DNA is separated by gel 

20 electrophoresis, and hybridized with probes derived from, for example, genomic or cDNA 

sequences. The methylation status of the Pro4 1 2-related gene can be determined by comparison of 
the restriction pattern generated from the sample DNA with that for a standard of known 
methylation. 

Furthermore, gene constructs such as those described herein can be utilized in diagnostic 
25 assays to determine if a cell's growth or differentiation state is no longer dependent on the function 
of a Pro4 12-related protein, e.g. in determining the phenotypc of a transformed cell. Such 
knowledge can have both prognostic and therapeutic benefits. To illustrate, a sample of cells from 
the tissue can be obtained from a patient and dispersed in appropriate cell culture media, a portion 
of the cells in the sample can be caused to express a recombinant Pro4 12-related protein or a 
30 Pro4 12-related target protein, e.g. by transfection with a expression vector described herein, or to 
increase the expression or activity of an endogenous Pro4 12-related protein or Pro4 12-related 
target protein, and subsequent growth of the cells assessed. The absence of a change in phenotype 
of the cells despite expression of the Pro4 1 2-related proteins or Pro412-related target protein may 
be indicative of a lack of dependence on cell regulatory pathways which includes the Pro4I2- 
35 related proteins or Pro4 12-rclatcd target protein, e.g. Pro4 12-related proteins- or Pro4 12-related 
target-mediated transcription. Depending on the nature of the tissue of interest, the sample can be 
in the form of cells isolated from, for example, a blood sample, an exfoliated cell sample, a fine 
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needle aspirant sample* or a biopsied tissue sample. Where the initial sample is a solid mass, the 
tissue sample can be minced or otherwise dispersed so that cells can be cultured, as is known in the 
art 

In yet another embodiment, a diagnostic assay is provided which detects the ability of a 
Pro4 1 2-related gene product, e.g., isolated from a biopsied tissue, to bind to other cellular, cell 
surface or extracellular proteins. For instance, it will be desirable to detect Pro412-related proteins 
mutants which, while expressed at appreciable levels in the cell, are defective at binding or 
preferably cleaving a Pro412-rclatcd target protein (having either diminished or enhanced 
proteolytic activity or binding affinity for the target). Such mutants may arise, for example, from 
mutations, e.g., point mutants, which may be unpractical to detect by the diagnostic DNA 
sequencing techniques or by the immunoassays described above. The present invention 
accordingly further contemplates diagnostic screening assays which generally comprise cloning 
one or more Pro41 2-related genes from the sample tissue, and expressing the cloned genes under 
conditions which permit detection of an interaction between that recombinant gene product and a 
target protein, or under conditions which permit assessment of proteolytic activity. As will be 
apparent from the description of the various drug screening assays set forth herein, a wide variety 
of techniques can be used to determine the ability of a Pro4 1 2-related protein to bind to other 
components. These techniques can be used to detect mutations in a Pro4 1 2-related gene which give 
rise to mutant proteins with a higher or lower binding affinity for a Pro4 12-related target protein 
relative to the wild-type Pro4 12-related proteins. Conversely, by switching which of the Pro4l2- 
related target protein and Pro4 12-related protein is the "bait" and which is derived from the patient 
sample, the subject assay can also be used to detect Pro4 12-related target protein mutants which 
have a higher or lower binding affinity for a Pro4 12-related protein relative to a wild type form of 
that Pro412-rcIatcd target protein. 

In an exemplary embodiment, a target protein can be provided as an immobilized protein (a 
"target"), such as by use of GST fusion proteins and glutathione treated microtitre plates. A gene (a 
"sample" gene) is amplified from cells of a patient sample, e.g., by PCR, ligated into an expression 
vector, and transformed into an appropriate host cell. The recombinantly produced Pro4 12-related 
protein is then contacted with the immobilized target protein, e.g., as a lysate or a senu-purified 
preparation, the complex washed, and the amount of target protein /Pro4 12-related proteins 
complex determined and compared to a level of wild-type complex formed in a control. Detection 
can be by, for instance, an immunoassay using antibodies against the wild-type form of the Pro4 12- 
related protein, or by virtue of a label provided by clon ing the sample Pro4 1 2-related gene into a 
vector which provides the protein as a fusion protein including a detectable tag. For example, a 
myc epitope can be provided as part of a fusion protein with the sample Pro4 12-related gene. Such 
fusion proteins can, in addition to providing a detectable label, also permit purification of the 
sample Pro4l2-rc!ated protein from the lysate prior to application to the immobilized target In yet 
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another embodiment of the subject screening assay, the two hybrid assay, described in the 
appended examples, can be used to detect mutations in either a Pro4 12-related gene or Pro412- 
related target gene which alter complex formation between those two proteins. 

In one embodiment, die biological sample contains protein molecules from the test subject. 

5 Alternatively, the biological sample can contain mRNA molecules from the test subject or genomic 
DNA molecules from the test subject A preferred biological sample is a serum sample isolated by 
conventional means from a subject. In another embodiment, the methods further involve obtaining 
a control biological sample from a control subject, contacting the control sample with a compound 
or agent capable of detecting a Pro4 12-related protein, mRNA, or genomic DNA, such that the 

10 presence of a Pro412-related protein, mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of a Pro4 12-related protein, mRNA or genomic DNA in the control 
sample with the presence of a Pro4 12-related protein, mRNA or genomic DNA in the test sample. 
The invention also encompasses kits for detecting the presence of Pro4 12-related protein, mRNA 
or genomic DNA in a biological sample. For example, the kit can comprise a labeled compound or 

15 agent capable of detecting a Pro4 12-related protein or mRNA or genomic DNA in a biological 

sample; means for determining the amount of a Pro4 12-related proteins member in the sample; and 
means for comparing the amount of Pro4 12-related protein in the sample with a standard. The 
compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect Pro412-rclatcd protein or nucleic acid. 

20 In certain embodiments, detection involves the use of a probe/primer in a polymerase chain 

reaction (PCR) (see, e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202, the disclosures of which are 
incorporated herein by reference in their entireties), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, e.g., Landcgrcn el al. (198S) Science 
241:1077-1080; and Nakazawa ct al. (1994) PNAS 91:360-364, the disclosures of which are 

25 incorporated herein by reference in their entireties), the latter of which can be particularly useful 
for detecting point mutations in the Pro412-rclatcd proteins-gene (see Abravaya et al. (1995) 
Nucleic Acids Res. 23:675-682, the disclosure of which is incorporated herein by reference in its 
entirety). This method can include the steps of collecting a sample of cells from a patient, isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic 

30 acid sample with one or more primers which specifically hybridize to a Pro4 12-related gene under 
conditions such that hybridization and amplification of the Pro412-rclated proteins-gene (if 
present) occurs, and detecting the presence or absence of an amplification product, or detecting the 
size of the amplification product and comparing the length to a control sample. It is anticipated that 
PCR and/oT LCR may be desirable to use as a preliminary amplification step in conjunction with 

35 any of the techniques used for detecting mutations described herein. 

Genotyping assays for diagnostics generally require the previous amplification of the DNA 
region carrying the biallehc marker of interest However, ultrasensitive detection methods which 
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do not require amplification are also available. Methods well-known to those skilled in the art that 
can be used to detect bialielic polymorphisms include methods such as, conventional dot blot 
analyzes, single strand conformational polymorphism analysis (SSCP) described by Orita el al., 
PNAS 86 : 2766-2770 (1989), the disclosure of which is incorporated herein by reference in its 

5 entirety, denaturing gradient gel electrophoresis (DGGE), hcteroduplex analysis, mismatch 
cleavage detection, and other conventional techniques as described in Sheffield et al. (199 1), 
White et al. (1992), and Grompe et al. (1989 and 1993) (Sheffield, V.C. et al, Proc. Natl. Acad. 
Sci. U.S.A 49:699-706 (1991); White, M.B. et al., Genomics 12:301-306 (1992); Grompe, M. et 
al., Proc. Natl. Acad. Sci. U.S.A 86:5855-5892 (1989); and Grompe, M. Nature Genetics 5:1 11- 

10 117 (1993), the disclosures of which arc incorporated herein by reference in their entireties). 
Further methods are described as follows. 

In one aspect, the nucleotide present at a polymorphic site can be determined by standard 
sequencing methods. In other examples, microsequencing methods are used, and the nucleotide at 
a polymorphic site in a target DNA is detected by a single nucleotide primer extension reaction as 

15 described by Chen and Kwok (1997) and, Chen and Kwok (Nucleic Acids Research 25:347-353 
1997) and Chen et al. (Proc. Natt. Acad. Sci. USA 94/20 10756-10761,1997), the disclosures of 
which are incorporated herein by reference in their entireties). A preferred method of determining 
the identity of the nucleotide present at an allele involves nucleic acid hybridization. Any 
hybridization assay may be used including Southern hybridization, Northern hybridization, dot blot 

20 hybridization and solid-phase hybridization (see Sambrook et al., Molecular Cloning - A 

Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., 1989), the disclosure of 
which is incorporated herein by reference in its entirety). Hybridization conditions should be 
sufficiently stringent that there is a significant difference in hybridization intensity between alleles, 
and preferably an essentially binary response, whereby a probe hybridizes to only one of the 

25 alleles. Methods and chips of various formats for use in detecting bialielic polymorphisms can be 
produced on a customized basis by Affymetrix (GeneChip), Hyseq (HyChip and HyGnostics), and 
Protogene Laboratories and also described in as described in PCT application Nos. WO 92/10092 
and WO 95/1 1995; US patent No. 5,424,186; and EP 785280, the disclosures of which are 
incoporated herein by reference. 

30 

Having generally described this invention, a further understanding can be obtained by 
reference to certain specific examples which are provided herein for purposes of illustration only, 
and are not intended to be limiting unless otherwise specified 

35 EXAMPLES 
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Example 1: Purification and Characterisation of a Pro4l2-related protein 

Human plasma was fractionnated by a series of multiple chromatography columns. The fractions 
5 were analysed by mass spectrometry. Five tryptic fragments were found to belong to the protein 
sequence of Figure 1 by tandem mass spectrometry: FPGAVDG ATY ILVM VDPDAPSR, 
HWLVTD1K, IQGQELSAYQAPSPPAHSGFHR, YQFFVYLQEGK and V1SLLPK. 

Example 2: Pro412-related proteins possess a PEBP-like functionnal domain 
10 The Pro412-related proteins of the invention display in their C-term part a sequence 

predicted to be folded into an active site like the canonical ones found in the PEBP proteins. This 
structure has been described to form a Hgand binding pocket at one end of the central beta sheet 
(BanGeld et aL, supra). Interestingly, within this Hgand-binding site, the Glu83 in the human PEBP 
protein (highlighted in Figure 2), which has been shown to adopt a relatively rare Cis peptide bond 
15 conformation* is mutated to Phe in the Pro4l2-related proteins of the invention, which is a residue 
not commonly found in a Cis conformation. In the mouse homolog of the Pro412-rclated proteins 
of the invention, this residue is mutated to Tyr (see Figure 3). 

Example 3: Pro4 12-related proteins possess a secretion signal peptide and a conserved, 
20 specifc N-terminus domain 

The N-ter part of the Pro412-rclatcd proteins of the invention is found to correspond to a 
motif conserved across species, which comprises a very clear signal peptide for secretion of the 
protein and a conserved repetition of 4 Cys residues (sec Figure 3). Computer model isation of this 
conserved structure shows that it folds into a stable tertiary structure. 

25 

Example 4: Tissue distribution of Pro4 12-related proteins 

Searches were conducted to establish the tissue distribution for two Pro412-rclated 
proteins of the invention, using the ESTs database dbEST (available at 

http://www.ncbi.nlm jiih.gov/dbEST/). For the protein ofSEQ ID NO:2, the distribution was as 
30 follows: out of 16 total ESTs, 7 were from pectoral muscle tissue (after mastectomy) (accession 
numbers: F20527, F28028, F33888, F36903, F251 13, F26233, F2601 3); two were normal lung 
tissue (BF846558, BM984582) and three from carcinoma lung tissue (AA903899, AA902491, 
A1863957); and finally two were from healthy other tissues (H83927, A 12 18954) and two from 
rumorigenic other tissues (AA854779, BG2 14764). 
35 For the protein of SEQ ID NO.4, the distribution was as follows- out of 8 total ESTs, 3 

were from normal prostate (AA688029, AA661735, AI65975 1), 2 were from carcinoma lung tissue 
(AI857902. AI816715), I was from skeletal muscle (AA192427) and 2 from other tissues 
(BE044451,AW975331). 
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Example 5: Chemical Synthesis of a Pro4 12-related protein 

In this example, a Pro4J2-related protein of the invention is synthesized. Peptide fragment 
5 intermediates are first synthesized and then assembled into the desired polypeptide. 

A Pro412-related protein can ini tally be prepared in, c.g 5 fragments, selected to have a 
Cys residue at the N-terminus of the fragment to be coupled. Fragment 1 is initially coupled to 
fragment 2 to give a first product, then after preparative HPLC purification, the first product is 
coupled to fragment 3 to give a second product After preparative HPLC purification, the second 
10 product is coupled to fragment 4 to give a third product. Finally, after preparative HPLC 

purification, the third product is coupled to fragment 5 to give the desired polypeptide, which is 
purified and refolded. 

Thioester formation. Fragments 2, 3, 4, and 5 are synthesized on a thioestcr generating 
resin, as described above. For this purpose the following resin is prepared: S-acetylthioglycolic 

15 acid pcntafluorophenylester is coupled to a Leu-PAM resin under conditions essentially as 

described by Hackeng et al (1999). In the first case, the resulting resin is used as a starting resin 
for peptide chain elongation on a 0.2 mmol scale after removal of the acetyl protecting group with 
a 30 min treatment with 10% mcrcaptocthanol, 10% piperidine in DMF. The N* of the N-tcrminal 
Cys residues of fragments 2 through 5 are protected by coupling a Boc-thioproline (Boc-SPr, i.e. 

20 Boc-L-thioproline) to the terminus of the respective chains instead of a Cys having conventional 
N*or S° protection, e.g. Brik et al, J. Org. Chem, 65: 3829-3835 (2000). 

Peptide synthesis. Solid-phase synthesis is performed on a custom-modified 433A peptide 
synthesizer from Applied Biosystems, using in situ ncutn»lization/2-(lH-benzotriazol-l-yl)- 
1,1,1,3,3-tetramethyluromum hcxafluoro-phosphate (HBTU) activation protocols for stepwise Boc 

25 chemistry chain elongation, as described by Schnolzer et al, Int. J. Peptide Protein Res., 40: 180- 
193 (1992). Each synthetic cycle consists of N a -Boc -removal by a 1 to 2 min treatment with neat 
TFA, a I -min DMF flow wash, a 10-mm coupling time with 2.0 mmol of preactivatcd Boc-amino 
acid in the presence of excess DIEA and a second DMF flow wash. Na-Boc-amino acids (2 mmol) 
are preactivatcd for 3min with 1.8mmol HBTU (0.5M in DMF) in the presence of excess DIEA 

30 (6mmol). After coupling of Gin residues, a dichloromcthane flow wash is used before and after 
deprotection using TFA, to prevent possible high temperature (TFA/DMF)-catalyzed pyrrolidonc 
carboxylic acid formation Side-chain protected amino acids arc Boc-Arg(p-loluencsu!fonyl)-OH, 
Boc-Asn(xanthyl)-OH, Boc-Asp(0-cyclohexyl)-OH, Boc-Cys(4-methylbcnzyl)-OH, Boc-Glu(0- 
cyclohexyl)-OH, Boc-His(dinitrophcnylbcnzyl)-OH, Boc-Lys(2-Cl-Z)-OH, Boc-Ser(benzyl)-OH, 

35 Boc-Thr(benzyl)-OH, Boc-Trp(formyi)-OH and Boc-Tyr(2-Br-Z)-OH (Orpagcn Pharma, 

Heidelberg, Germany). Other ammo acids arc used without side chain protection. C-tcrminal 
Fragment 1 is synthesized on Boc-Lcu-0-CH 2 -Pam resin (0.71mmol/g of loaded resin), while for 
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Fragments 2 through 5 machine-assisted synthesis is started on the Boc-Xaa-S-CHr-CO-Leu-Pam 
resin. This resin is obtained by the coupling of S-acctyltbioglycolic acid pentafluorophenylcster to 
a Leu-PAM resin under standard conditions. The resulting resin is used as a starting resin for 
peptide chain elongation on a 0.2 mmol scale after removal of the acetyl protecting group with a 
5 30min treatment with 10% mercaptoelhanol, 10% piperidine in DMF. 

After chain assembly is completed, the peptide fragments are deprotected and cleaved 
from the resin by treatment with anhydrous hydrogen fluoride for 1 hr at 0°C with 5% p-cresol as a 
scavenger. In all cases except Fragment 1, the imidazole side chain 2,4-dinitrophenyl (DNP) 
protecting groups remain on His residues because the DNP-removal procedure is incompatible 
10 with Oterminal thiocstcr groups. However DNP is gradually removed by thiols during the ligation 
reaction, yielding unprotected His. After cleavage, peptide fragments are precipitated with ice- 
cold diethylether, dissolved in aqueous acetonitrile and lyophilized. The peptide fragments are 
purified by RP-HPLC with a CI8 column from Waters by using linear gradients of buffer B 
(acetonitile/0. 1% trifluoroacetic acid) in bufFer A (H 2 O/0. 1 % trifluoroacetic acid) and U V 
15 detection at 214nm. Samples are analyzed by electrospray mass spectrometry (ESMS) using an 
Esquire instrument (Briicker, Bremen % Germany), or like iastrument. 

Native chemical ligations. As described more fully below, the ligation of unprotected 
fragments is performed as follows: the dry peptides are dissolved in cquimolar amounts in 6M 
guanidine hydrochloride (GuHCl), 0.2M phosphate, pH 7.5 in order to get a final peptide 
20 concentration of 1 -8 mM at a pH around 7, and 1% benzylmercaptan, 1 % thiophcnol is added. 
Usually, the reaction is carried out overnight and is monitored by HPLC and electrospray mass 
spectrometry. The ligation product is subsequently treated to remove protecting groups still 
present The formyl group of Trp is cleaved by shitting the pll of the solution up to 9.0 with 
hydrazine and incubating for 1 h at 37° C Opening of the N-tcrminal thiazoltdine ring further 
25 required the addition of solid mcthoxamine to a 0.5M final concentration at pH3.5 and a further 
incubation for 2h at 37°C. A 10-fold excess of Tris(2-carboxycthyl)phosphine is added before 
preparative HPLC purification. Fractions containing the polypeptide chain arc identified by ESMS, 
pooled and lyophilized. 

The ligation of fragments 4 and 5 is performed at pH7.0 in 6 M GuHCL The concentration 
30 of each reactant is 8mM, and 1% benzylmercaptan and 1% thiophcnol were added to create a 
reducing environment and to facilitate the ligation reaction An almost quantitative ligation 
reaction is observed after overnight stirring at 37°C. At this point in the reaction, 0-NH 2 .HCl is 
added as a powder to a 0. 1 M final concentration and hydrazine is added to shift the pH to 9.0, for 
the removal of the formyl group of any Trp residues. After a Ih incubation at 37°C, 0-NH 2 HCl is 
35 further added to the solution to gel a 0.5M final concentration, and the pH adjusted to 3.5 in order 
to open the N-iermmal thiazolidine ring. After 2h incubation at 37°C, ESMS is used to confirm the 
completion of the reaction The reaction mixture is subsequently treated with a 10-fold excess of 
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Tris(2-carboxycthylphasphine) over the peptide fragment and after 15min, the ligation product is 
purified using (he preparative HPLC (e.g., C4, 20-60% CH^CN, 0.5% per min), lyophilised, and 
stored at -20°C. 

The same procedure is repeated for the remaining ligations with slight modification. 
5 Whenever the ligation takes place at an I!c-Cys or Val-Cys site, the ligation reaction is extended to 
48h. 

Polypeptide Folding. The full length peptide is refolded by air oxidation by dissolving the 
reduced lyophilized protein (about 0.1 mg/mL) in 1M GuHCl, lOOmMTris, lOmM methionine, 
pH 8.6 After gentle stirring overnight, the protein solution is purified by RP-HPLC as described 
10 above. 



Example 6: Preparation of antibody compositions 



Substantially pure Pro412-rclated protein or a portion thereof is obtained. The 

1 5 concentration of protein in the final preparation is adjusted, for example, by concentration on an 
Amicon filter device, to the level of a few micrograms per ml. Monoclonal or polyclonal 
antibodies to the protein can then be prepared as follows: Monoclonal Antibody Production by 
Hybridoma Fusion Monoclonal antibody to epitopes in the Pro412-related protein or a portion 
thereof can be prepared from murine hybridomas according to the classical method of Kohler and 

20 Milstein (Nature, 256: 495, 1975) or derivative methods thereof (see Harlow and Lane, Antibodies 
A Laboratory Manual, Cold Spring Harbor Laboratory, pp. 53-242, 1988), the disclosure of which 
is incorporated herein by reference in its entirety. 

Briefly, a mouse is repetitively inoculated with a few micrograms of the Pro412~related 
protein or a portion thereof over a period of a few weeks. The mouse 1st then sacrificed, and the 

25 antibody producing cells of the spleen isolated. The spleen cells are fused by means of 

polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by growth 
of the system on selective media comprising aminopterin (HAT media). The successfully fused 
cells arc diluted and aliquots of the dilution placed in wells of a microliter plate where growth of 
the culture is continued. Antibody-Producing clones arc identified by detection of antibody in the 

30 supernatant fluid of the wells by immunoassay procedures, such as ELISA, as originally described 
by Engvall, E., Meth. Enzymol. 70: 419 (1980), the disclosure of which is incorporated herein by 
reference in its entirety. Selected positive clones can be expanded and their monoclonal antibody 
product harvested for use. Detailed procedures for monoclonal antibody production arc described 
in Davis, L. et at. Basic Methods in Molecular Biology Elsevier, New York. Section 21-2, the 

35 disclosure of which is incorporated herein by reference m its entirety. 



Polyclonal Antibody Production by Immunization 
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Polyclonal antiserum containing antibodies to heterogeneous epitopes in the Pro412- 
relatcd protein or a portion thereof can be prepared by immunizing suitable non-human animal 
with the Pro412-reIated protein or a portion thereof, which can be unmodified or modified to 

5 enhance irnmunogenicity. A suitable nonhuman animal, preferably a non-human mammal, is 
selected. For example, the animal may be a mouse, rat, rabbit, goat, or horse. Alternatively, a 
crude protein preparation which, has been enriched for Pro412-related proteins or a portion thereof 
can be used to generate antibodies. Such proteins, fragments or preparations are introduced into the 
non-human mammal in the presence of an appropriate adjuvant (e. g. aluminum hydroxide, R1B1, 

10 etc.) which is known in the art. In addition the protein, fragment or preparation can be pretreated 
with an agent which will increase antigenicity, such agents are known in the art and include, for 
example, methylated bovine serum albumin (mBSA), bovine serum albumin (BSA), Hepatitis B 
surface antigen, and keyhole limpet hemocyanin (KLH). Serum from the immunized animal is 
collected, treated and tested according to known procedures. If the serum contains polyclonal 

15 - antibodies to undesired epitopes, the polyclonal antibodies can be purified by immunoaffinity 
chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 

20 level) of antigen administered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are known in the art, see for example, Mayer and 
Walker (1987), the disclosure of which is incorporated herein by reference in its entirety. An 
effective immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. 
Endocrinol. Metab. 33: 988-991 (1971), the disclosure of which is incorporated herein by 

25 reference in its entirety. Booster injections can be given at regular intervals, and antiserum 

harvested when antibody liter thereof, as determined semi-quantitativcly, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, for 
example, Ouchtcrlony, O. et al., Chap 19 in: Handbook of Experimental Immunology D. Wier (ed) 
Blackwell (1973). Plateau concentration of antibody is usually in the range of 0. 1 to 0.2 mg/ml of 

30 serum (about 12: M). Affinity of the antisera for the antigen is determined by preparing 

competitive binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of 
Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amcr. Soc. For Microbiol.. Washington, 
D. C.(1980). 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
35 protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; or they are also used semi-quanlitativcly or qualitatively 
to identify the presence of antigen in a biological sample. The antibodies may also be used in 
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therapeutic compositions for killing cells expressing ihe protein or reducing the levels of the 
protein in the body. 
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CLAIMS 

1. A substantially purified protein having an amino acid sequence of SEQ ID NOS 1 to 4; and 
biologically active portions thereof. 

2. A substantially purified protein consisting of an amino acid sequence of SEQ ID NOS 1 to 4; 
and biologically active portions thereof 

3. A Pro4l2-related protein according to claim 1, wherein said protein comprises at least one 
amino acid deletion, substitution or insertion with respect to said amino acid sequence of claims 1 
or 2. 

4. A substantially purified protein comprising a Pro412-related protein according to claims 1 or 2, 
wherein said Pro412-related protein is fused to another, heterologous protein or amino acid 
sequence. 

5. A method of making a Pro4 1 2-related protein, said method comprising 

providing a population of host cells comprising a recombinant nucleic acid encoding a 
Pro412-rclatcd protein of Claims 1 or 4; and 

culturing said population of host cells under conditions conducive to the expression of said 
recombinant nucleic acid; 

whereby said protein is produced within said population of host cells. 

6. The method of claim 5, further comprising purifying said protein from said population of cells. 

7. An antibody that selectively binds to the protein of Claims 1 to 3. 

8. An antibody according to Claim 7, wherein said antibody is capable of inhibiting binding of said 
protein to a Pro412-related substrate. 

9. A method of determining whether a Pro4 12-rclated protein is expressed within a biological 
sample, said method comprising the steps of: 

(a) contacting a biological sample from a subject with a detectable protein that selectively 
binds to the protein of Claims I or 2; and 

(b) detecting the presence or absence of binding of said detectable protein to a protein within 
said sample; 

wherein a detection of said binding indicates that said Pro412-rclatcd protein is expressed 
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within said sample. 

10. The method of claim H, wherein said detectable protein is an antibody. 

5 LI. A method of determining whether a mammal has an elevated or reduced level of Pro412- 
related protein expression, said method comprising the steps of: 

(a) providing a biological sample from said mammal; and 

(b) comparing the amount of a Pro4 12-related protein of SEQ ID NOS 1 to 4 within said 
biological sample with a level detected in or expected from a control sample ; 

10 wherein an increased amount of said Pro4 12-related protein within said biological sample 

compared to said level detected in or expected from said control sample indicates that said 
mammal has an elevated level of Pro4 12-related protein expression, and wherein a decreased 
amount of said Pro4 12-related protein within said biological sample compared to said level 
detected in said control sample indicates that said mammal has a reduced level of Pro4 12-related 

15 protein expression. 

12. A method of identifying a candidate Pro4 12-related protein modulator, said method 
comprising: 

(a) contacting a Pro4 12-related protein according to Claims I or 2 with a test compound; and 
20 (b) determining whether said compound selectively binds to said protein; 

wherein a determination that said compound selectively binds to said protein indicates that 
said compound is a candidate inhibitor of a Pro4 12-related protein, a candidate Pro412-relatcd 
protein modulator. 

25 13. A method of identifying a candidate Pro4 12-related protein modulator, said method 
comprising: 

(a) contacting said Pro4 12-related protein of Claims 1 or 2 with a test compound; and 

(b) determining whether said compound selectively modulates interaction with a Pro4 12- 
related protein substrate protein, 

30 wherein a determination that said compound selectively modulates said interaction indicates 

that said compound is a candidate Pro412-relaied protein modulator. 

14. A method of identifying a candidate Pro4 12-rclatcd protein modulator, said method 
comprising: 

35 (a) contacting said Pro4I2-rclatcd protein of Claims 1 or 2 with a test compound; and 

(b) determining whether said compound selectively inhibits a biological activity of said 
Pro4 12-related protein, 
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wherein a determination that said compound selectively inhibits a Pro4I2-rcIatcd activity 
indicates that said compound is a candidate Pro4i2-reiated protein inhibitor. 
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Figure 1 

MGWTMRLVTAALLLGLMMVVTG DEDENSPCAHEALLDEDTLFCQGLEVFYPELGNIGCKV 60 

VPDCttNYRQKITSWMEPIV KFPG&VDGATYILV^^ 120 

ADXKEGK IQGQELSAYQAPSPP2U1SGFHKXQET?VYIjQ&G&iFL SUjPKE N KTRGS WKMDRF 180 

LNRFHLGEPEASTQE^TONYODSPTLQAPRERASEP KHKNQAEJA^C 227 
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Figure 2 

pdblbeh VDLSKWSGPLSLQEVDEQPQHPLHVTYAGAAVDELGKVLTPTQVKNRPTS 
pdbla44 VDLSKWSGPLSLQEVDERPQHPLQVKYCGAEVDELGKVLTPTOVKNRPTS 
5 Q9 6S 9 6 E DENS PC AH EALLDEOTLFCQGLEVFY PELGNI GCKWPDCNNYRQKITS 

< beta > < beta > 

pdblbeh ISWDGLDSGKLYTLVLTDPDAPSRKDPKYKESWHHFLWNMKGNDI 

pdbla4 4 ITWDGLDPGK1,YTLVXTDPDAPSRKDPKYREWHHFLVVNMKGNNI 

10 Q96S96 WMEPI VKFPGAVDGAT Y ILVMVDPDAPSRAEPRQRFWRHWI*VT DI KGADL 

<beta> < beta > 

pdblbeh S SGTVLSOYVGSGPPKGTGLHRYVWLVYEQDRPLKCDEPILSNRSG 

pdbla4 4 S SGTVLSDYVGSGPPKGTGLHRYVWLVYEQEGPLKCDEPILSNRSG 

15 Q96S96 KEGKTQGQEIiSAYQAPSPPAHSGFHRYQFFVYLQEGKVISLLPKEN 

< alpha > < beta > 
pdblbeh DHRGKFKV AS FRKK Y ELRAP VAGTC YQAEWDD YV PKLYEQLSG 

pdbla4 4 DHRGKFKVASFRKKYELGAPVAGTCYQAEWDDYVPKLYEQLSG 
20 Q96S96 KTRGSWKMDRFLNRFHLGEPEASTQFMTQNYQDSPTLQAPRER 
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Figure 3 



096S96 
08WW74 
Q9D9G2 
Q9D9L9 



DEDENS PCAHEALLDEDTLFCQGLEV FY PELCN I GfcKVVPDC'NN YRQKIT 
DEDENSPbAHRALLDBDTLFCQGLEVFYPELGNIG0.KWPDpNNYRQKIT 

CFIiPPLPKEDVSTiCRNLEVFYMEMGNISCKIVPKGNLYRQKIP 

"CFLPPLP KEDVSL<3RNI»EVFYMEMGN J. S]GKI VPKCNI.YRQKIT 
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Q96S96 
Q8WW74 
Q9D9G2 
Q9D9L9 



SWMEPI VKFPGAVDGAT Y I LVMVDPDAPSRAE PRQRFWRHWLVTDI KGAD 
SWMEPIVKFPGAVDGATYIIiVMVDPDAPSRAEPRQRjB^RHWLVTDIKGAD 
AWQAPIVKFHTALDGALYlrLVMVDPDAPSRSNPVMKYWRHWLVSNITGAD 
AWQAPI VKFHTALD 
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20 



Q96S96 
Q8WW74 
Q9D9G2 
Q9U9L9 



LKEGKIQGQELSAYQAPSPPAHSGFHRYQFFVYLQEGKVISLLPKENKTR 
LKEGKIQGOELSAYQAPSPPAHSGFHRYQFFVYLQEGKVISLLPKENKTR 
MKSGSIRGNVLSDYSPPTPPPETGVHRYQFFVYLQGDRDISLSVEEKANL 
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Q96S96 
OBWW74 
Q9D9G2 
Q9D9L9 



GSWKMDRFLNRFHLGEPEASTQFMTQNYQDSPTLOAPRERASEPKHKNQA 
GSWKMDRFLNRFHLGEPEASTQFMTQNYQDSPTLQAPRGRASEPKHKTR. 
GGWNLDKFLQQYGLRDPDTSTQFMTQ 
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Q96S96 
Q8WW74 
Q909G2 
Q9D9L9 



EIAAC 
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SEQUENCE LISTING 

<3 10> Coneprot, Inc. 
5 <120> secreted proteins 
<130> 
<160> IB 

<170> Patentln version 3.0 



10 



<210> 1 

<?n> 227 

15 <212> PRT 

<213> Homo Sapiens 

<220> 

<221> misc feature * 

20 <222> (I) .7(227) 

<223> entire protein, 'erase' form 



<220> 

25 <221> SIGNAL 

<222> (1)..(22) 

<223> predicted by SignalP version 2.0 



30 <220> 

<221> SITE 

<222> (211).. (211) 

<223> predicted variation site 



35 



40 



55 



<220> 

<221> misc_feature 

<222> (220) . . (227) 

<223> sequence related to the variation site 211 



<220> 

<221> DOMAIN 
<222> (90).. (112) 
45 <223> matching PROSITE pattern PBP PS01220 

(phosphatidylethanolanune-binding protein) 

<400> 1 

50 Met Gly Trp Thr Met Arg Leu Val Thr Ala Ala Leu Leu Leu Gly T.eu 
3 * 5 10 15 

Met Met Val Val Thr Gly Asp Glu Asp Glu Asn Ser Pro Cys Ala His 
20 25 30 



Glu Ala Leu Leu Asp Glu Asp Thr Leu Phe Cys Gin Gly Leu Glu Val 
35 40 45 



Phe Tyr Pro Glu Leu Gly Asn He Gly Cys Lys Val Val Pro Asp Cys 
60 5 0 5 5 60 

Asn Asn Tyr Arg Gin Lys He Thx Ser Trp Met Glu Pro lie Val Lys 
65 70 75 80 

65 Phe Pro Gly Ala Val Asp Gly Ala Thr Tyr He Leu Val Met Val Asp 
85 90 95 

Pro Asp Ala Pro Ser Arg Ala Glu l J ro Arg Gin Arg Phe Trp Arg His 
100 105 110 

90 



Trp Leu Val Thr Asp lie Lys Gly Ala Asp Leu Lys Glu Gly Lys lie 
115 120 125 

5 Gin Gly Gin Glu Leu Ser Ala Tyr Gin Ala Pro Ser Pro Pro Ala His 
130 135 140 

Ser Gly Phe His Arg Tyr Gin Pbe Phe Val Tyr Leu Gin Glu Gly Lys 
145 150 155 160 

10 

Val lie Ser Leu Leu Pro Lys Glu Asn Lys Thr Arg Gly Ser Txp Lys 
165 170 175 

Met Asp Arg Phe T.eu Asn Arg Phe His Leu Gly Glu Pro Glu Ala Ser 
15 1B0 185 190 

Thr Gin Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro Thr Leu Cln Ala 
195 200 205 

20 Pro Arg Glu Arq Ala Ser Glu Pro Lys His Lys Asn Gin Ala Glu lie 
210 215 220 



25 



30 



35 



40 



Ala Ala Cys 
225 

<210> 2 

<211> 205 

<2J2> PRT 

<213> Homo Sapiens 

<220> 

<221> raise feature 

<222> <1) .7(205) 

<223> mature peptide 

<400> 2 

Asp Glu Asp Glu Asn Ser Pro Cys Ala His Glu Ala Leu Leu Asp Glu 
15 10 15 

Asp Thr Leu Phe Cys Gin Gly Leu Glu Val Phe Tyr Pro Glu Leu Gly 
20 25 30 



Asn He Gly Cys Lys Val Val Pro Asp Cys Asn Asn Tyr Arg Gin Lys 
45 35 40 45 

He Thr Ser Trp Met Glu Pro He Val Lys Phe Pro Gly Ala Val Asp 
50 55 60 

50 Gly Ala Thr Tyr He Leu Val Met Val Asp Pro Asp Ala Pro Ser Arg 
65 70 75 00 

Ala Glu Pro Arg Gin Arg Phe Trp Arg His Trp Leu Val Thr Asp lie 
85 • 90 95 

55 

Lys Gly Ala Asp Leu Lys Glu Gly Lys He Gin Gly Gin Glu Leu Ser 
100 105 110 

Ala Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe His Arg Tyr 
60 115 120 125 

Gin Phe L'he Val Tyr Leu Gin Glu Gly Lys Val He Ser Leu Leu Pro 
130 135 140 

65 Lys Glu Asn Lys Thr Arg Gly Ser Trp Lys Met Asp Arg Phe Leu Asn 
145 150 155 160 

Arg Phe His Leu Gly Glu Pro Glu Ala Ser Thr Gin Phe Met Thr Gin 
165 170 175 
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Asn Tyr Gin Asp Ser Pro Thr Leu Gin Ala Pro Arg Glu Arg Ala Ser 
180 185 190 

Glu Pro Lys His Lys Asn Gin Ala Glu lie Ala Ala Cys 

195 200 205 



10 



<210> 

<?n> 

<212> 
<213> 



3 

223 
PRT 

Homo Sapiens 



15 



<220> 
<?.21> 
<222> 
<223> 



nus cofeature 
(1). 7(223) 

entire protein, 'grase 1 form 



<220> 

20 <221> STGNAL 

<222> (1)..(22) 

<223> predicted by SignalP version 2.0 



25 <220> 

<221> SITE 

<222> (211) . . (211) 

<223> predicted variation site 

30 

<220> 

<221> misc_feature 

<222> (220) . . (223) 

<223> sequence related to the variation site 211 

35 

<220> 

<221> DOMAIN 

<222> (90).. (112) 

40 <223> matching PROSITE pattern PBP PS01220 

(phosphat ldylethanolamine-binding protein) 

<400> 3 

45 

Met Giy Trp Thr Met Arg Leu Val Thr Ala Ala Leu Leu Leu Gly Leu 
15 10 15 

Met Met Val Val Thr Gly Asp Glu Asp Glu Asn Ser Pro Cys Ala His 
50 20 25 30 

Glu Ala Leu Leu Asp Glu Asp Thr Leu Phe Cys Gin Gly Leu Glu Val 
35 40 45 

55 Phe Tyr Pro Glu Leu Gly Asn lie Gly Cys Lys Val Val Pro Asp Cy3 
50 55 60 

Asn Asn Tyr Arg Gin Lys He Thr Ser Trp Met Glu Pro lie Val Lys 
65 70 75 80 

60 

Phe Pro Gly Ala Val Asp Gly Ala Thr Tyr lie Leu Val Met Val Asp 
85 90 95 

Pro Asp A La Pro Ser Arg Ala Glu Pro Arg Gin Arg Phe Trp Arq Hts 
65 100 105 110 

Trp Leu Val Thr Asp lie Lys Gly Ala Asp Leu Lys Glu Gly Lys -le 
115 120 125 
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Gin Gly Gin Glu Leu Ser Ala Tyr Gin Ala Pro Ser Pro Pro Ala His 
130 135 140 

Ser Gly Phe His Arg Tyr Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys 
5 145 150 155 160 

Val He Ser Leu Leu Pro Lys Glu Asn Lys Thr Arg Gly Ser Trp Lys 
165 170 175 

10 Met Asp Arg Phe Leu Asn Arg Phe His Leu Gly Glu Pro Glu Ala Ser 
180 185 190 

Thr Gin Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro Thr Leu Gin Ala 
195 200 205 

15 

Pro Arg Gly Arg Ala Ser Glu Pro Lys His Lys Thr Arg Arg Arg 
210 215 220 

<210> 4 

20 <211> 201 

<212> PRT 

<213> Homo Sapiens 

<220> 

25 <??1> mxsc TeaLure 

<222> (1) -7(201) 

<223> mature peptide 

30 <400> 4 

Asp Glu Asp Glu Asn Ser Pro Cys Ala His Glu Ala Leu Leu Asp Glu 
15 10 lb 

35 Asp Thr Leu Phe Cys Gin Gly Leu Glu Val Phe Tyr Pro Glu Leu Gly 
20 25 30 

Asn He Gly Cys Lys Val Val Pro Asp Cys Asn Asn Tyr Arg Gin Lys 
35 40 45 

40 

lie Thr Scr Trp Met Glu Pro He Val Lys Phe Pro Gly Ala Val Asp 
50 55 60 

Gly Ala Thr Tyr He Leu Val Met Val Asp Pro Asp Ala Pro Ser Arg 
45 " 65 70 75 80 

Ala Glu Pro Arg Gin Arg Phe Trp Arg His Trp Leu Val Thr Asp He 
85 90 95 

50 Lys Gly Ala Asp Leu Lys Glu Gly Lys He Gin Gly Gin Glu Leu Scr 
100 105 110 

Ala Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe His Arg Tyr 
115 120 125 

55 

Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val He Ser Leu Leu Pro 
130 135 140 

Lys Glu Asn Lys Thr Arg Gly Ser Trp Lys Met Asp Arg Phe Leu Asn 
60 145 150 155 160 

Arg Phe His Leu Gly Glu Pro Glu Ala Ser Thr Gin Phe Met Thr Gin 
165 170 175 

65 Asn Tyr Gin Asp Scr Pro Thr Leu Gin Ala Pro Arg Gly Arg Ala Ser 
180 185 190 

Glu Pro Lys Hts Lys Thr Arg Arg Arg 
195 200 
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<210> 
<211> 
<212> 
<213> 



b 

12042 
DNA 

Homo Sapiens 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<220> 

<221> misc_f eature 
<222> (1) . . (12042) 



10 <223> publxc sequence NT_008300.6 (545299. . 557340 c) 



<220> 
<22X> 
<222> 



exonl 

(1952) . . (2082) 



<223> coding exon 
<220> 

<221> CDS 

<222> (1952) . . (2080) 
<220> 

<221> exon2 

<222> (9356) . - (9482) 

<220> 
<221> CDS 

<222> (9357) . .(9482) 
<400> 5 

ggttcaaatt ccaggtctgc cacttcttgg tgaccaagat 
tgtgctgtct cctcctccat aagatggggt gataataqtg 
Lgaggattcg gagataaaag tgcctggcac gtggagtaag 
gggctcaaca gcaqcaacct gcact-gtggg ctctcccttc 
gctaaaggag ttgacaccca gaaaagttga gacttctcta 
gagtttcttg gtcttgggca gcccttctgg cagcatctcc 
agctgcatgc tcaacctata ccccagggcc caacatgggg 
aatgcctatg agtgaagtgg gttgcaaagg aatgaatgaa 
tacttagoct attcataaat tattgagcat ttactatgtg 
ttaacctggt tgatttcatt ctcataatga tcttgtgagg 
ccactagggg agatcttggg aatatgtttt ctggttgggg 
acttcagaga acctaatqat tgagaaatgt gtttaatatg 
ttgctacagg ttgggatcct ggaggcagga attgagccag 
tagrttctct aaagqtctag ctgcgtgtac tttacagcga 
aggcaccacg gtgacaggaa ttattttact tgtttgtgaa 
agggagacac agtggcctcc cagcatccag agagagctca 
gagttgctgg cacqtct-ctc cracccccgc ctctgagaag 
aagcctgttg caagccgcct ttattcttct ccctgcccac 
cacatgcacc Lgqlqgcagg actgttcccc aagggaatgc 
ccttcctccc agtgcactgc tggggcaggc cacccctttc 



caggttttaa 
cctatttcca 
gaagaaccgc 
ttccaggccg 
gtggatgcca 
ttggctgtgg 
caggcacgtg 
tggatggatc 
ccaggcacct 
aggctactat 
caagggggag 
tagcccctaa 
aaagcagacfc 
cgatctcata 
aaqgaaaagt 
gagcagaagc 
cagagagcag 
tcaggctcaa 
aggcgacagt 
tgagccccaa 



ctaatgtcac 


60 


aggattggca 


120 


cactggaagt 


180 


ccatcactgt 


240 


ILLggaagat 


300 


ctgtcctgtg 


360 


ggaggtgctt 


4 20 


cgtgagagaa 


480 


tactgagggc 


540 


tagactctgt 


600 


tcctggttgt 


660 


ggggggcttt 


720 


atgctgaaag 


780 


ttccttagca 


840 


ggtgcattct 


900 


caggagtcct 


960 


agctattgga 


1020 


accagagtcc 


1080 


ccccggctcc 


1140 


gcagctcccc 


1200 



94 





aaacctcctt 


ctccagatac 




aaaaaotctct aaatacaaca 


cagctgeaag 


1260 




atacacccaa 


acacaccctt 






etteggaget 


1320 


5 


aacctcootc 


tcrqqatacac 


ctacccccac 


cccctocaoc aotccctcct 


acacaaggtc 


1380 




artafat era 


accacctcca 






y ~j ~j *yy ** ^ 


1440 


10 




caggagagtc 


a f? A CCCtCI t" O 


fnannnnpar aofccsuicfcc 


ctaoccctaa 


1500 


gcctggggtc 


accagcttcc 


cagggcttgc 


cttacagcca aggagcaggc 


tggcctttgg 


1560 




ccagctgacc 


atccctgggc 


caacagcagg 


gactggacaa ggagctgeae 


cgaggagctt 


1620 


15 


atataagcac 


ttctactttg 


caqcctqctc 


tctgccctgg cagccaggac 


aagagtgaca 


1680 




acaaagctgt 


gtcctgccag 


gggaggggee 


tetgecagga aegtacattg 


agtccaactt 


1740 


20 


ggaagaggee 


cagtcctccc 


aaagtacttg 


tgtccgggtg gtggactgga 


ttagctgegg 


1800 


agccctggaa 


gctgcctgte 


cttctccctg 


tgcttaacca gaggtctcta 


tcctgccaag 


1860 




tgggcccctg 


gggtcggggg 


cacaggtagt 


gggcaggaag gcccctgcag 


gggatccagc 


1920 


25 


cgctctaaag 


ttctttcfgt 


tccaggtgcc 


c atg ggt tgg aca atg 
Met Gly Trp Thr Met 


agg ctg 
Arg Leu 


1972 



1 5 

gtc aca gca gca ctg tta ctg ggt etc atg atg gtg gtc act gga gac 2020 
30 Val Thr Ala Ala Leu Leu Leu Gly Leu Met Met Val Val Thr Gly Asp 
10 15 20 

gag gat gag aac age ccg tgt gec cat gag gec etc ttg gac gag gac 2068 
Glu Asp Glu Asn Ser Pro Cys Ala His Glu Ala Leu Leu Asp Glu Asp 
35 25 30 35 

acc etc ttt tgc cagtaagtca gggcaggctt ggagqcggqa qqcacagagg 2120 
Thr Leu Phe Cys 
40 

40 

ctaccaggca ggtagatggc 
agectgattt tatgggaagg 
45 tggcactgag aaagggattt 
caggaccagg agggctqtgt 
ctttgtgggg gcaggaggct 

50 

ctgcogcctg aatgccccct 
ggtgtatgat tagacccttc 
55 tttgtgcagg gecctttagg 
cttcccttcc aagggcacaa 
gctctctcct caccaactct 

60 

ggtcatcctg aggacLcccc 
tgaggagggg gacaccccac 
65 tttgactact tactctcttc 
tgtttgtttt t-ct-tfLtaat 
ataaatacat gtcttctctt 



agggaggggt 


atatccagaa 


ccaagaagca 


ccatctcctg 


2180 


cagaacacgt 


gtcagatctg 


gagaccacct 


tgatggagga 


2240 


agaaacagac 


tccttcctgc 


cttccagaat 


gcacctctcc 


2300 


atgtggtcgg 


etctgettgg 


gecctceaaa 


cctggggggt 


2360 


gagaccctca 


gagagagacc 


cagagagaga 


ccatccctcc 


2420 


cLccccagaa 


ccccaacaaa 


gaggcatttg 


aatgaaagaa 


2480 


ctgtgtgaga 


ggtttttqtc 


aqtqatagga 


cccaggagcg 


2540 


gatcatggca 


ctggatggag 


ttgggtgacg 


aggctggctt 


2600 


getgettcta 


ctttgggtaa 


gctggaaagc 


ctggcgttcg 


2660 


geeggttaga 


aaagcaagee 


ttcatttgtg 


tggacttgtg 


2/20 


aggecaaact 


gatgecaaac 


ctccctgcca 


gtttggtggc 


2780 


tgagcccacg 


tgtqagaaat 


qaqaqaqgag 


gaaaaagaac 


2840 


tgtacctttg 


aattttcaca 


aacattactg 


ttcatgaaaa 


2900 


tttatttaaa 


acatttttga 


caaacattac 


ttttagaacc 


2960 


aaaattatgt 


agttgaatcc 


ttctcaaaaa 


gtaacatttt 


3020 
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tgtLgggtga agtggctcac atgtgtaatc 
cacttgagcc caggggtttg agaccagcct 

5 

aaacagttta aaaactagcc aagcatgctg 
ggttgaggtg ggaggatcgc ttaaacccag 
10 gccactgcat tccagcctgg gcaacagagc 
caacaaaaag taacaaactt tgttcgtgtc 
tctgqqtgta ggctgggcnc ggtggctcac 

15 

ggcaggcaga taacaaggtc aggagtttga 
tctctacLaa aaataaaaaa aaattagccg 
20 tactcaggag gctgaggcag gagaatccct 
cgagattgcg ccactgcact ctagcctggg 
aacaaaaaaa caancaagca agaaaacagt 

25 

agagaaacgt ggttgcctct cacaagtggg 
gcLgaggggc tggcaggctt ctcgctctga 
30 gagaggtcag ctgggccagg gtacaqcagg 
gacctcagca gttcctggtg tttcccacac 
tgggggtgac cagccLgagg ctccagatgc 

35 

tggtgccctg gtgacccatg catggggaca 
tgccaggtgc tctgccggtg cctggactcc 
40 acaaagtctg tgttagcacc cccattctgc 
tgcccacagt gtttacatag cttaaaagta 
ctccacactc acgctctttc cctatctgct 

45 

cccagactcg agtcaaggqg gcccottctc 
agtctgagaa agagacttat gaccttaagg 
50 aagccatcat cgcaaaagag gagctttgtc 
gagagcatca tgaatgatga tgaaatgaaa 
gagacagcag gacctcctgc actgacccct 

55 

accttggcaa gagatccccc agagaaggca 
aactcttaaa gattggactc aatgcctaaa 
60 tactgtgaca agctgtgaga cattgttaag 
tgcttctgag ataccccaga caagaccagc 
agtcgcagaa tggaaagaac catgattctc 

65 

aaaaaacccc acctgctgct gccatccagt 
ctgctgctca aagcgtcctg ctttttggtg 



ccagcacttt 


gggaggccaa ggtggaggat 


3080 


gggcaacata gagagactct tttctctaca 


3140 


atgcacacct 


gtcgtcccag ctactcaaga 


3200 


gaggtcaagg ctggtctgag ccatgattgc 


3260 


cagaccctgc 


ctaaacaaca acaacaacaa 


3320 


agtatatttt gaaaaatctt gttaaatagt 


33B0 


acctgtaatc ccaagacttt gggagactga 


3440 


gaccagcctg 


accaacacgg tgaaaccctg 


3500 


ggcatggtgg cgcacacctg taatcccagc 


3560 


tqaatctggg 


aggtggaggt tgcagtgagc 


3620 


cgacagacca 


agactctgtc tcaaaaaaca 


3680 


Ictgggtgaa gttttcacgt ggaaggagga 


3740 


atctaatttt 


ctaggtgctc tgctgagtga 


3800 


gcagtgcaag 


gtcaacaccc ccgtaaccag 


3860 


cctgggccac 


aataagggcc ccttaaggag 


3920 


tgtgggaaat gaggatgtct gtagggtcat 


3980 


cctaggctct 


gccctgccaa gatcacatgt 


4040 


cacgtgggaa 


gtcccagggc accttctgtg 


4100 


ttgactcctt 


tcactgtcag taacgaacct 


4160 


agatgaagaa 


tccccagaga ttaagaaatc 


. 4220 


gcagagccaa 


gatctagccc ccttgtctga 


4280 


tctgacaacc 


ttcaccccag gttcagggct 


4340 


aggactccca 


gaatacacca gcaggccccc 


4400 


ttgcatgcca 


ggaagatgtc ttttgatttc 


4460 


tttattgtca 


ctgaagattt ttgtttggaa 


4520 


aatqcatcaa 


ggacagatcg gacccctgag 


4580 


cctcaagatc 


cccactcggc tcaggctctg 


4640 


ctcctcLtgt 


gggcatttgg tctaccaggg 


4700 


ggtattaatc 


tataagqqaq gaattttatg 


4760 


agtaacgctt 


ctcgggaatc agaccttccc 


4820 


aaaaccaccg 


acggtcccaa ccctgggctc 


4880 


caaacctgca 


gccaggaggt gaatccactc 


4940 


cagccclgca 


agctccccta gggatgggaq 


5000 


ttgaaaagct 


ccgttcttca gaaagtacat 


5060 
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tcttaccccg aggccctgct gctcLtcttg 
ccacgaggtg cacaaactag tctagtttct 
5 tgacaggtgc catgccctgc cctgcccctt 
atccccagtc cttcaacctc tctcaggggc 
ttagagacag agtcttgtct gtcacttagg 

10 

tgcagccttg aacacctggg ctcaagcaat 
actacaggtg cacgccacca cacctggcta 
15 tcggtatgta gtccaggctg ttctcaaact 
ctcccaaagt gccggtatta caggcatgca 
ggggtgaccc acactttcac agtgagcccc 

20 

ctttttgtca tcctcactgt cttagtctgt 
ggtcatttct aaacagcagg aatgtattac 
25 atcaaggtgc taghggattc agtgtgcggt 
ccttcttgct gcgacttcac atgggggaag 
gagggtgcct tttataaggg tgacttaatc 

30 

tcatattgag gattaaattc caacatataa 
agcactcact gaggggcagc tgaaattcta 
35 acgtttctag agacagggtc tcagtgcatc 
gcaggatcat ggctcactgc aaccttgacc 
cctcccatat agctggacta cagatgcgta 

40 

ttatagaqac aggatctcar tacgttgccc 
atcatctcac ctcagcctcc caaagtgctg 
45 ccttgtgatc LaaLtcttgc catgacagag 
cctggaggaa tgaaacagct cactaqqgaa 
aagcccacag tactagttct ctgttgtaca 

50 

aaataggtat tgtccaacag tttctcagtt 
gctgcctcaa ggtatggttg catgaaatga 
55 Lggggtggcc cagtaaaagt catgcccttc 
tggcaaagct ggggccaggg cccctgatct 
tcttgtcaag gagatcagga gatccccaaa 

60 

gtcaggtaaa ccacaagcag caagggcaca 
cttgaatgca agttgttttt gagatagggt 
65 gatgcaatca tggctcactg cagcttctac 
ctgggaacac aggcaggcac tarcatggtg 
tttggtagaa acagggtctc actatgttgc 



aacaccaacc ctacctgttg tagtgcggat 5120 

catcttcttt gtggtcattc atgtgtgtgg 5180 

catccccagc tccccccaac ctgggcagtc 5240 

cttggtctct cctgtttaaa tacatttctt 5300 

ctggagtgca gtggcctgat catagctcac 5360 

cctcctgcct caacgtcctg agtagctggg 5420 

agtttcttat tttttgtaga gttgggagtc 5480 

cctgggctca agcaatcctg ctgcctcagc 5540 

ctgccatacc tggcttgctg tttttcaagt 5600 

ctcttcgagg cagcctcata tcccatgtgt 5660 

tcagctacca taacaaaata ccttaqacta 5720 

tcacaactct ggaggctgag aagcgccaaa 5780 

acaaggttgc tctctgcttc aaagatggca 5840 

ggaagggcag ctcccttcag cctcLcLtat 5900 

acctcttcca agtcccgctt gttaatacca 5960 

attttgagga gacacaaaca ctcagaccat 6020 

agcccttcat acaggcacag ctgagaagcc 6080 

accctctgtc acccaggctg gaatgtagaa 6140 

tcccaqactc aagcgatcct cccacctcag 6200 

ccaccacact tggctaattt tttctatgtt 6260 

aggctggcct ctaactcctg ggctcaagtg 6320 

ggattacagg cttgagccac cacgcccagc 6380 

acccacatga ccacctagct caaactgaag 6440 

aacgtgtggfc gcggggaaga aagtgcagcc 6500 

caacagatca ctccaacacg tagcagctgc 6560 

actcatctgg agataggaat agtgataaca 6620 

aatcatacat atgtgatgtc cccgatatat 6680 

atccccagtc ccttatgtag cattagttca 6740 

tttcactcca ctgtctagag tgccttcaag 6800 

ctctccacat ggaaattaga agaattccat 6860 

caggaaggtg gcacctttga ggttttctgg 6920 

cttqctctgt cacccaggrt ggagtgcggt 6980 

ctcctgggct caagtgatcc tcccaagtag 7040 

tttttttttt gLLtgtttgt ttttttaatt 7100 

ctaggctggt ctcagartcc tgggctcaag 7160 

97 



f- 



Lgalgclccc accccagcct cccaaagagc 
qtcttqaatg caaaatttaa agtgfcatttt 

5 

cctcaggttt gtccaggtcg tctcgtaagt 
gctctgtgcc atctgcagat caaataaatt 
10 aaagcagctg agcagggcag agqcctqcac 
aatacgctat gttcaaacgc tggcctggca 
gacaagaccc aaatccctgg ttccctggca 

15 

gctctgtgcc cacctcatca aataacaatt 
atttacaaca attccactct gataaaaagg 

20 tccaccctca gtttacctcc tccctacctc 
ctgtctctct gcccttggtg ggtcccctct 
aggagttgca gcagaaacaa accaaacggt 
gtcacctaaa acttctccaa aagctgtgaa 
atgcgcaaag agttagaaga aacaaacttg 

30 actctctgng accagagcat gtgacttctg 
ctggtccggg gctgccaggt gctgggggct 
ggagccactg qaagggtggg aggtggaagg 

35 

gaagctgctc ttggcctact cccaggcccc 
tctccaaatt acatcctcac agggtcccct 
40 ccgctgtggg tggctaccgg caggctgtag 
ggctggggac ctgcaacacc cactctctac 
accctcgtta accctacaga ggacttaggt 

45 

caaaaacagt tttoaaaaat gtctttgaaa 
taagcatgca cttacagcca gcctggtgct 
50 gcaggctctg ttgcct/tgct ctgcatcata 
agaaccctgg tqctqttcgg gcccttcttc 
catactgaca gtcttagtga gccagtgttt 

55 

gagaacct.ag gctacagLca atgcttaacc 
gatttcaggc atggggagct cttgattgtt 
60 gqaagctgtg ctgacttgct aagcttttct 
tctataggta ttgtttttat aaactcaatc 
atatctgttc ttctagggaa gtaaaggctg 

65 

ttacddcgtg ccaggttctg ttctgaccat 
catgagggag gaattattag tqLcttcgtt 



tgagattata ggcatgagcc 


accgtgccta 


7220 


ctcagtaact ttaatttttg attcagctca 


7280 


caatttgatc atcagaattg 


ttggttacca 


7340 


tgtctccgtg tccttctcca 


agactttgct 


7400 


atgccccttt cccatcatcc 


taggtggggc 


7460 


gtgccctatg ggcgtgaaga 


gagggacaga 


7520 


tgtgccctgt ccttcagggc actctctgtg 


7580 


tgatgatttt gcttgtttaa 


tccttttgat 


7640 


ggagaggggt tatgggcgtg 


agggcgtccc 


7700 


tcagttttcc catcctgtgc 


tgtttgctgt 


7760 


ctgcctcgca tctcgctagc 


tgcggcagca 


7820 


ccttaagatg atgctaggaa 


gataaaactg 


7880 


aatcttqcca tccaccaaga 


attgctgtaa 


7940 


ctcagcaagc ccatgactca 


tccggcttcc 


8000 


gcactggggg cgctggccca 


ccgaagtgtc 


8060 


cagcggcctt ttgcccaagg ctggacttca 


8120 


tgggaggggc aatgcagggt 


tccccaccag 


8180 


aggttcaggg agctcccttt 


ccagatctat 


8240 


tagatatcac ccccagcaag geaclgLgga 


8300 


agcggcttag ccctgcaagg 


tgttaggaaa 


8360 


ccacaccctt tctctctcag 


tgatgtctta 


8420 


catggagata aattataato 


gaagggcaag 


8480 


aatcttacct gtgttcctga 


gtgtcaaata 


8540 


gtcatggccc agcccggcct 


gccctgccct 


8600 


ttttcttccc ttccaggaag 


ccacattctt 


8660 


tcagggagca aatacatgaa 


ctggttcatt 


8720 


taccccctga gggatgtttg 


cattttgaaa 


8780 


caaagaagcc aattcttcac 


ctaatggtgg 


8840 


tagggqagag agaaggtcta 


tgcccttgaa 


8900 


ttctgtaata agaatgttcc 


tgaaaagcaa 


8960 


ctactttaaa gacactaagt 


ttgcctaagg 


9020 


atgataatta atattatact 


I tctgatgac 


9080 


tgaccgccac ttttcttccc 


acactgactc 


9140 


ttacaggtga ggaaactgag 


gcacaaggag 


9200 
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gtcaagtaac tcttcggtca cacaactagt tagtggcagg tttoggattt gatcccatll 9260 

cactttacat tcccaotaqt gctagactga agccttggga accccaggct cccttaaaac 9320 

5 cctgacacag ggcaacctcc cctctctctt catagg ggc ctt gaa gtt ttc tac 9374 

Gly Leu Glu Val Phe Tyr 
45 

cca gag ttq ggg aac att ggc tgc aag gtt gtt cct gat tgt aac aac 9422 
10 Pro Glu Leu Gly Asn lie Gly Cys Lys Val Val Pro Asp Cys Asn Asn 
50 55 60 65 

tac aga cag aag ate acc tec tgg atg gag ccg ata gtc aag ttc ccg 9470 
Tyr Arg Gin Lys lie Thr Ser Trp Met Glu Pro He Val Lys Phe Pro 
15 70 75 80 

ggg gec gtg gac gtgagtgagc agggctgtgt taaagcgggg ggacagttgg 9522 
Gly Ala Val Asp 
85 

20 

gggtcttgat gttcctccag gettgetcat ctgggtqact tgaagctgcg tgeagageca 9582 

tcLgggttgg gtctggtggc tgcagatggc tgttgccata aagttgtgcc ttccttogag 9642 

25 tggacttgag gagaaggctg ggggctggtg ggagggctgc agactggaaa ctttggtgat 9702 
ggttgaggcc attctgccca ggccttttcc tgctcccctg ctctgggagc tccttggctg 9762 
cccctgagat ctgggaocc:c aagcctggtg gagaccagct ggtgtgtgct ttggagttgg 9822 

30 

cagataagtg gaagaaggcg tggtatqqqa ggagtcctct ctcaaggcgc ggggtgcctg 9882 
ggecttcaga tggcttctgg gctggagaga agctggtatt taccaagaca gagcttccca 9942 

35 agttgcctaa acaggcaqnq gtcagtgcat ettgacgett tgttgtcaca acatgagaca 10002 

cgctcttctt ttgcccaaag ttgtgtaaaa gaagaacaaa gcattttagg aactggaaat 10062 

aactatttta ctgggcagtg attagtgeta ttacatttag gggtctggaa agcctgggga 10122 

40 

agagtcatag aagtgggaaa tgggtctget gctgccctaa geagecagag tgtgttttgg 10182 

ggaagctgat ctggccttct ggtctccgct tctttatttc ttgactcctg ggccagactg 10242 

45 agcccgcaga gtgtcaatgg tggagactgg cctcttttct gctgtgtttt agaaggaggt 10302 

getgetgtea atctaggcag gacttgggtc acaaatgect ggagatggaa gggtcgtggc 10362 

cgggatgtga ccaagcacac acaccaaggc tgaccttctg gtcctttctg tgcagggtgg 10422 

50 

cagggaaggg ccacaagggg ttcttcctlc ccctcagtgc tgccttggtt aggectcatg 10482 

cagtcagcta ageccagtag qacaacaagc tccagagggc gqcagcacac aagttactcc 10542 

55 aatgcgggga atagactctt ctctttgaag tgtttttcac tatcagagag caaatagatt 10602 

gtgatatggc tegcaagtta tgaggctqtt gtcatgaagc tttgctaggt tgtcttgttt 10662 

gcaaacacga caaacaccct ttcctttggg aagacgagca ataaagtgea cacgtgcaca 10722 

60 

tgtgtgcaca tgtgcacaca cacacLLgca tgcacaccca tgtgcacaca gcgtcagccc 10782 

tcttccctgg aaacccttgg actgggcaca cctgtggaca qccaagagga gcagagaaag 10842 

65 gaagectget gattctaact cctgggatgg gaggtgeaga ggccgggcag gaacccaatc 10902 

ttttagagga gtatatgeta etctgeggga caggactctc ggcLLcacca Lccattcact 109G2 

tgtcttgggg tttgtccclt atgtagtgag ggtcagtegg atgetttcac aggecaggga 11022 

99 



cagtcccaag ggaattctcc ccaaagggct ccettccgcg tgctgatgat gctgcagaca 11082 

tccactagat ggcagtgtag acttttetga gcacctggtg tcctccagag cagggatttg 11142 

aagcctgttg ggaaagtctt tggtttgatt tggtttcgtt gccctggctt atcagcacat 11202 

tacacaaaga cgacatcacc cacgccacat cctgaagtct tgtactgtta aaoccatcct 11262 

gcctcagccc gggggtgggg tggggagaaa acggatgtgg aacatagaga tttatgagga 11322 

acacagagct gtggtcaaga aagcctggga gagtgaagat tttacaaagg aagatttgat 11382 

cgaagctgtg ggaagaagaa aaacatagta ttctactact atggaactat agtttatgga 11442 

aatgtaacag acctatgaga aggcagattc agagagattt gccaaagtgg tcagtgagtg llb02 

ccaccttatt ccattcctgg gcactactcg gacccttcat ttaagtctcc aagttaagta 11562 

tttctttcat caagactatt aatggcatgt gacaaaaagt caactcaaat cggcttaagc 11622 

cagaaaagag ggggaaacat tggttcacaa aactgaaaac tccagggata acttcaggcg 11682 

aggctggatc caggtgctca gtgteaccag gaatctgtct cttgattgtt tttttctgtg 11742 

ctgttttcat ttgtaaatag agtcttccct tgtgacaaag atggccccaa cagccttagg 11802 

tttgtgtttg accaatttgt caaccccagt ggaaataggg ttttccatcc attaaaagtc 11862 

ac.iggattgg tctcgctttg gtcttgtgcc tatccctgaa cctagcagtg accctctaca 11922 

gctcccctta atctacaagg actgagagta gaggaggatg gttgccctga atgcaagtck 11982 

aggtgtgatt gatcagataa gggtgaaagg atgttaggta ggaaatgcaa ggcaggtcct 12042 

<210> 6 

<211> 2111 

<212> UNA 

<213> Homo Sapiens 

<220> 

«221> misc_feature 

<222> (l)..(211l) 

<223> public sequence NT_008300.6 (445091. . 447201 c) 
<220> 

<221> exon3 

<222> (1794).. (1892) 

<220> 

<221> CDS 

<222> (1794) .. (1892) 
<400> 6 

aatttccctt tLlaLgatca tctttaatag ctatctctaa cagctatctc caattcctcg 60 
tctcccatct cagctcagcc tattccagcc aaatatcagt cctttctact ccactgaaac 120 
tggtcttatc gtgataacag tgatatccaa gttgtcaaat cagaltgtca catctgtgtc 180 
ctcatctggc cctacctltt agcagcattt gatatagctg atcactttct cttcctggag 240 
acactttcta tcctggcttt tgtgacacct gctttccagg tgtcccccca aactttttgg 300 
ctcLaaagtc ccctttccct gattctcccc tcctctcaat tgtagcatgt tttacagttc 360 
agccatgaga cctcttttcL Lctctgtgtc tgctctactg atttaaaggc catctctata 420 
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tcaatgacta ccaaatttat tcacccagtc ctgatctctc ctctgaatta cagaccgaca 480 

tgcataccta tattctgagg ggcatttcaa actcaagata tctaaaccag aactctgaat 540 

gtccaccttc aaacagcctc ctctccaagt cttatttgta ccatttaatg gcccagttgc 600 

Leaagccaga aacccaggat tcatcctcaa tttctctttc tccatccccc tccttactgc 660 

cgtgccgttg aatctatcag caaaccctgc cgattttacc taaaaatact cccccaaccc 720 

atctatttgt ttctgtctcc atttctgcca tcctagccca agataccaac atccttccca 780 

tgctactgcg agtctcctaa cLggccLcct ggcctcctct ctggcctttg gagatcattc 840 

attttctact cagcagccaa aagaatcctc taaaagcata aatcaqagca tatcactgtc 900 
ctgccccaaa ccctgaaaca gttctgtgtt gcactttgaa taaaacccca ctccctatcc 



gaa ccc aga cag aga ttc tgg aga cat tgg ctg gta aca gat ate aag 
Glu Pro Arg Gin Arg Phe Trp Arg His Trp Leu Val Thr Asp Tie Lys 
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.aggectagag agccccctgt ggcctggcct ctgctccctc ccctgcacct gccatgtcca 1020 

ccctccccct tgcttgtccc tggccacaat ggtgctcttt cttqctcctc aaagectact 1080 

ttLUcccac ctgaagacct tcatagcagc tatttttttc cttccagaac acttcccacc 114 0 

tctcatccca ccctgtttta ttttcatcat agcactactc acacgttccc ttgcttattt 1200 

atttattttt attatctgeg tcttttacct agaacaaaag cttcctgaaa gtatggacct 1260 

tactgtcttg tgctctgttg ccLcctggca cctagagcag cttctggcac acagtaggtg 1320 

ttcagcaaag gcttgttgaa gggatgaatg aatcctggtg ttctttgtga haggtaccgt 1380 

tcttgacaca aaataagtcc tcatatattt gttgaattga taaacaagtc actccaacta 1440 

cactggattc ttcttgagag gaccagagac ctgttttagg tttccttgat tcctccacgg 1500 

caclcagcac agegctcaga acttcccagg tgcttggtat gtatatcttg ttqqatttat 1560 

tcttcctcac aaagaggaag ccaagagtcc tgctgtctgc ttcacagatg ccatttctct 1620 

gagaataaac gtgatgtgag agggatgtca cgagagoang gataaactgg atgggacgcc 1680 

ataggggcta ctcagccatt ctcggttgga atggcgggaa aaggaggtct gccccgtggt 174 0 

45 attttcctat ttcagccaLg aegctacatt actttteggt acatgtgtta cag ggc 1796 

Gly 
1 

n? 9 tu C £ at t} C ? tg gtg atg * tK * gat cca ^at gec cct age aga gca 1844 
Ala Thr Tyr lie Leu Val Met Val Asp Pro Asp Ala Pro Ser Arg Ala 
5 10 15 



1892 

ASD Tip T.v.<* 

25 30 

gtaagcaggc caagaggcac tctctgacgc atttggggtt gtatctggaa gattatcatt 1952 

caqtgcagtt cccacctctc atccaggaag gttggagcga ggacttctgt gtcccatctc 2012 

ccccagtgct cagctatcaa agccctccca agttttccaa caatttcaaa aagtgtccca 2072 

gqgccagtgg tatgctggta acccagcttt ctagaaggg 2111 

<210> 7 

<211> 10041 

<212> DNA 

vPT3> Homo Sapiens 
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<220> 
<221> 
<???> 
<223> 

<220> 
<221> 
<222> 



misc_feature 
(1 ) . . (10041) 

public sequence NT_0083O0.6 (346211. 



.356251 c) 



exon4 
(1408) 



(1453) 



<220> 

<221> misc_feature 
<222> (1423) . . (1423) 
<223> predicted SNP site 

a is replaced by g in seqID 17 (176. 

and in seqID 18 (115.. 115) 



176) 



<220> 
<221> 
<222> 



ex on 5 
(3672) 



(3785) 



<220> 

<221> CDS 

<222> (3674) (3784) 
<400> 7 

agactccaaa aaaagggaag gaaggaagga aggaaggaag gaaggaagga aggaaggaag 
gaaggaagaa agaaagaaag aaaggaagga agaaagaagc aagccagcaa ggaatgaagg 
aaggaaggaa agatctatct cacctgtatc aggtgaagaa gacaaagtac aaggggctcg 
tgggggaagg gtctatcctt ccacagtctc ctgctccttc LLcuaaccaa atgagaccta 
gccctatagg aaccagcctt ggaagaggga tatctgacca agaggcccaa agtgaggaca 
ataatagaca agctgcaqtc taaaaggtag gagatctggt gggttctgga ctgagcgagt 
aggactttag ttctataaaa ggtaaacctc aagctagggg attggatctc tcctgcccct 
gcccccacct caagcagggc ctcccacctc tcccccatcc tcccagcatc ccatgagccc 
ctcttctagg cttggtgaaa aggtagccag ggcacagtta ggtgtcggca gactcaatta 
agagtcacgc taatcacctt cattaaggct ataaaacccc acacctaggg gccttggggc 
tccctggcat- ggcctggggc ccctgagcca tggcacagac ccaggggccc ggctccagtc 
tctttcatct gagcatctcc cagtcctgca caaacagtaa ttagcaccca ccuccggggt 
cacctcatta tcccatagac actgagtgga ggggaggagg gagggaaaga gaggagggaa 
actgaggctg tcccagtggc caaaqaaact gccccagggt tgtgggctga gttaactgag 
ggtggggaga agctgccaca gaatcaggca gtgtagcctc ctcctgcttt cccttccccc 
gctggttgct ctgagcccat tgcttaatat tctaaaggct ccaaagggag cagagtcctc 
cctccagaga agatgggctc cttaqtaaga tccagagaac tgcagaccLa agcgcctccc 
ttagatccga ccagaccctg caggtgatgt gtagcgccca gtctcaggca ccaataggga 
tcgtggaaca gctctttgqq aagtqtccta cagggcctct ctagagggaa ggtgtccatt 
tgcccccctc agcaatacgt gtgtctgtcc tctttccctg gatcccctga ggtacccatt 
ccaaaacccc actcactgcc ctgtctccca aaagcagcca tacatcctcc tgctctccca 
rtttcctcct ctcccatcct acttcttctc ccatcctcct tcctctccca tccLcctcct 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



15 



25 



gcagagagag aagcaatccc gcagagccca cgtggcctga agtgtgccca aqacataacc 1380 

racagccctg cttgtctctt ctttcagggc gccgacctga agaaagggaa gattcagggc 1440 

5 

caggagttat caggtgagta agaccctagg gaccccttcc cccagccagg gcafccagtga 1500 

cgatcacagg aatcaaaacc tcatggtgct gctttgtcgt gtgctagcac cagcctgaga 1560 

10 gttttcttgg atgccctgat ttcatctcca caactgcctt ttatggtcac tgccatcagt 1620 

gtccccattt taagaggata acactgaggc acagaccatc ctgcgccatg tgaatggcta 1680 

gtaaagaatg cagccagctt cacagtgcag ggggccatct gactggaggc tgcttgtaac 1740 

ccctgtccac ccccatcctt cccccagctc ccctctctca gatccctgcc ccaggaggat 1800 

gcccccaacc taccttgaaa gggatgctca ggtactgggc tggcctctgg cacagcactg 1860 

20 ggoaccaaac ttagccctgc agttccgtga agcagcgtgg gcttgggcac caagcaggcc 1920 

tggattcaaa tcccagcacc accactcact gttggcctgg ctttaccLct caagcctcca 1980 

ggaaatggga agatggagga gatgtcataa gtgaaggcct acagcccagg gcctggcccg 2040 

gagaagtggt ctagcatgtg tgaacccttt cccctcctcc tcacgcctca attctctgcc 2100 

tttggaggtt ttgaagaggg gattcttgag gaaagcccta gtgagaggco ttagtctcta 2160 

30 tctcccttga ggagatagag gataaattct ccattgcaca aatccagtga gttctcactt 2220 

cagacttgtc cgcacagaac tgaggtcacc agttttgggc agcgtcaagt gtgcccacgg 2280 

tggagtzgtac tccacagccc acatgcccac acatcttgcc cacactgact tgtcccatct 2340 

gccctccata cctaagcaca cgaggagact ccctggcacc cacctgtgct cagaagtctt 2400 

tctcagccct ggcggtgttt aggaagaaat gctcctttac cLgcatttac acacacacag 24 60 

40 qccctccaaa acagcatcgg ggcgtttgca cactggctgt tcatgtacac acacccaccc 2520 

cgagtgggac ctccaaccac acacatggct ctcagcacac ccagctctgc tcteactctg 2580 

agtgcacatg cgcgtgcaca cacacacaca cacacacaca cacacacata tatctgatcL 2640 

gcgatccaaa aatgcaggga aagatgaatc gtcaggaatt ccccggttgt tggcatgctg 2700 

gaagccacca gtggccggaa gtgcacagtg agggactctt ccagtgctcc aagcagcaag 2760 

50 tcgtgctttt gtttaatgat caggagagga aagaagggga gatggccata aaggaccaat 2820 

actccctccc accaacatcc tcattgcaaa tgcttctttc ccacccaccc agaaagattc 2880 

ttggctccag cctggctctc ctccttggaa aggaagatgg agagcgggaa ggggagtggg 2940 

aaggggtgaa gcacLctggg aaagaggccc accttgaaag acctaggcca ggccagcagg 3000 

accttgactc tgcatggact tqgggaaaga at gagggatg tccccacccc acttcccccg 3060 

60 ccccaaaaaa agagctcagc ctcctggagg gagagagaag gactgggaga gggaacttct 3120 

ccctgotgrr ccgagcaccc agctcctaca gctcccatga tcaaaccttt gaacccagtc 3180 

ccacccaccc ccgtaaccac ttcca'gaccc tccagacqca qccaagaaat tggcttcLt-t 3240 

ggggallqag ttcataagca aatgccgaaa aaataaataa aaatcctctg ccaaacacac 3300 

tcacagagga agatggqctb ctcttgaqgc aggaaalcca tcaggataga ctaaagcaaa 3360 



35 



45 



55 



65 
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10 



15 



aacaggacaa tgccaacacc cactggccca Laaaggcctg ggagccacgc cagcacccaa 3420 

ggccttcaag ccagctcagg gtgtttcctc ttggaccacc tcctccccca ctctcacccc 3480 

caccaaaata tgtacttctt aaatgaaaat actgaagctg agatcccgcc aaatacaaat 3540 

cctaatgttt tgcagggccg agtgctgatc tggggcccca gggaggtcag caggaccatg 3600 

ggaaggcaga gagctctgcc ctcggtagta tgggatgttg atgctcacct ctctcctgct 3660 

tctacctata gcc tac cag get ccc tec cca ccg gca cac agt ggc ttc 3709 
Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe 
1 5 10 

cat cgc tac cag ttc ttt gtc tat ctt cag gaa gga aaa gtc ate tct 3757 
His Arg Tyr Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val lie Ser 
15 20 25 



etc ctt ccc aag gaa aac aaa act cga ggtaagacct ccccatcctt 3804 
20 Leu Leu Pro Lys Glu Asn Lys Thr Arg 
30 35 

ccagccacca ccggggtgaa ttctggggtg gaacaaacge ttcaggcatt tacggggaaa 3864 

25 ttggggcctt gacccccact catgagcett ggagactggg gtcgaaggct ggtgtttctt 3924 

gggagtgaac tggaaggagt gecctaaaga tgttggcatc aggtgtggtc tcccttgaga 3984 

30 aQtaa 9 tccc acagaccagt gtagaggctc cagttgggtg gyltggtatg taaggcaggg 4044 

tctcaaagca ggtgactgcn gecagggaga agagtacccc gaaactttgc atggcatgtg 4104 

caggcagctg cccctgaact ttggcaagtc attgtgctct gtaattgagt ctgatctggg 4164 

35 caagagggee ctggaatgcc accctgccag tctatcctca tgggttgcta attactgggt 4224 

cggctgtgcc atctcagtga tgaggggaag atgtccccca ccatcccccc tccactagct 4284 

tctctgagca gccctccttg gcttctggag cagaecaact ccctatagcc tcctaatggc 4344 

40 

gaaccccgcc tgcttggcct cggtcctcct gccaccaggg ccaqqgagca ggagcggtgg 4 404 

ccacttggcc actgeagcat gecacaagtg tctcacctga cccagtgtga gccatgcccc 4 4 64 

45 cgactccccc acccgccatc tgagecttag atttctcttg cagaaaatgg gaaataaagc 4 524 

cacccctgca ctctcagctc cccaatttta gggaagggaa gacaatagag actgaacaat 4584 

qccatctgga agtgcaagaa gcacatcccc caggctggca ggaaggcaag tgggagggtg 4 64 4 

ggtgtcgtca gccaLttcct ctcacccagt gacctcgacc aaacatccac ccactaggaa 4704 

gctggcacag gtattgetae aaccgegatg ccggaggccc cacaggagag gaacatggag 4764 

55 cccagaaagg cacaggttga gegacaggge ccttcagatg tgccgagcag ggcccaccta 4824 

tcacagaggg ctccaggcca ccctgggagt ggagctggtg caaacttacc accagtgaac 4804 

cttccccagg tgccctcagc accttgccaa teegtgaate tgtttaaacg getttgecaa 4944 

agacatactt llaagtttag atgeagagag aaggaaacaa aatlatgtat cttgcccaga 5004 

ccaagaaaaa ctgtctccag caccctggct accgatctca ccaaagatgt cqqgacctca 5064 

65 gectatqaca ttctcccacc catggggttc tcataaaatg aagcagtgcc tctgagcccc 5124 

accatgagac teegggatea ccalgcctat gcccagggcg ggagtctctc gggggtgaay 5184 

tgctttcLga aggcaggatg gagggaagga lagagcatcg ggagatgeca tgaggaggga 5244 
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cagacaaggg agaggaaggg agctgcatac 
gggctcctga ggctcgacca cagcctgtcc 

5 

cagaggcctg acccacccct cccaagatgt 
tgggcctgaa acgcgttgtt tggtcttgtg 
10 tttgtccatg gggaacttag aaactggact 
ccaggctggg cgcggtggct cacacctaLa 
ggatcacgag gtcaggagat tgagaccatc 

15 

taaaaataca aaaaaaaaaa aaaattagcc 
ctactccgga ggctgaggca ggagaacagc 
20 ctgagatcgc gccactacac tccagcctgg 
aaaaaaaaaa aaatggccct cccagatgaa 
Igatgccttc catgtgccca gcgctgtgct 

25 

accaggaatt ttccctgtco ccaagtggct 
aatgcaaagc aggttgtaat taaacactta 
30 gaagtctgtg gactcaggga tagatctctt 
actgtgctct ctccaqccag agaggatata 
ggactcccct ctctctagtt agtagtttca 

35 

cctcagtttc ctaatctaga aaataagggg 
gccctgaggc catggggagg aggttggagc 
40 gaatccttga gtgaaggaag agggagctct 
taagtqagaa aatggagggg aaggggaaaa 
ggcttttgca ggttgactct gggaagccgt 

45 

aaaaglcact ctttggagac gacgaacctg 
ggacaggaca ccaccaqaqg ggtcgagaca 
50 gcgattgcaa aagcagaaac cccaggtcct 
gctctctgna gcacctggac ggtgcctggg 
ttgggaggcc agagcatggg qtcctgccaa 

55 

tagatttctc tccaggqgag agaatccgtg 
gggcagtgtq gtataaccag ctgcttgtgt 
60 agagactcga ggggaaaggt cagcctacag 
gggccactga aecceatcct gttcaggcct 
caggcttagg gtgaaqaaqq aggcttgaga 

65 

agggggatgt ggggatggca ttgttcagac 
tctttttact gtgaccacag gacaatgtac 



accaaagacg 


c-ttccctact 


ctgcgtgctg 


5304 


tccagaaacc acagtggcct 


gagggaccag 


5364 


tggtgaccag 


gcaaggaagc 


caaaacgcgc 


5424 


tacttcccca 


ucccacccct 


tgacccacct 


5484 


tgcatctgta cagggtttaa aatggccctc 


5544 


atcccagcac 


tttgggagac 


tgaggtgggc 


5604 


ctggctaaca 


tggtgaaacc 


ctgtctctac 


5664 


cagcgtggcg gtgggcgcct 


atagtcccag 


5724 


algaacctgg 


gaggcggagc 


ttgtagtgag 


5784 


ggacagagag 


aqactccgtc 


tcaaaaaaac 


5844 


agccccagcc 


aaggtcccag 


tcttgccatg 


5904 


gggaacttca 


gaggagacac 


aatcaagggc 


5964 


tacagtctac 


tttggaaaca 


attaaacaac 


6024 


attgtgtgat 


tctaactgta 


aggacaattg 


6084 


cccttcctgt cafcacctttc cctccagaac 


6144 


aaggtgtttt 


attqtagagt 


gaggcaatct 


6204 


cttLgaggcc 


aatcaatgaa 


cctcgctggg 


6264 


ttggctccag atqatctaca 


aatttctctg 


6324 


agggctgggc cctagagaag gaggagatct 


6384 


gggttggaga 


aaccatagat 


tttttttttt 


6444 


acaaggttct 


cagaagtccg 


gcagaggagt 


6504 


tagaaattat 


ttaccagaga 


agcaacactc 


6564 


gaqgggcagg 


cagaactgac 


cagaatgaga 


6624 


tgaggcagtg aggtgtggat 


gaacccaggg 


6684 


tgcacgatgt 


tgaaacactc agggcctggt 


6744 


gacttacgga 


gcccagggct 


ggaccagtgc 


6804 


tgaacacagg 


tcccaggact 


tgccttgatc 


6864 


gaggaagcag 


ggLgtcaggg 


gttgggaggg 


6924 


cccctgqgtg 


aacaggcagg 


aaaagcttct 


6984 


gagccaggta 


gaggggcqtc 


tatggcagac 


7044 


cttccctccc 


tggatct tat 


gggtaaatga 


7104 


tgattctagt 


agagcctgtc atgactatgt 


7164 


ccaaggaggt 


qaaaccctcc 


ccaggattcc . 


7224 


atgtggtcag 


caaatgagat 


cttgatgtcc 


7284 
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cgactctgat cgtttgacct tggccaggtg acctcccctc cctgggtctc aattttgacc 7344 

tcatcaagat acagggttag actagttgat ctgcaattga atgacctgaa attgatttgc 7404 

5 tcccagcttc ccggaatgtc ctggcctccc tgttctgcca aataaatgca gggaccaact 7464 

gatgaggctc gactttccca gccagccctg atacttccta gtagaggtct tgcatcagac 7524 

cccacqaqgc cagqagcaca gcaagggtcg tggaggggct tagagctacc ccacttctag 7584 

10 

ctgatgaaac cctgccgtgc caccccactc ctcctacctt tgcacagctc cttccfcctct 7644 

ggccacLLtc aaaccacgcc tggcataact cagtaatgcc ttctgcattc gctcagctga 7704 

15 tgccagatgc ctagtgtata cttggcgctg ttttqqqtqc tqtaagaagt caccatgtga 7764 

tggtgtccct gctctcatgc atgtgtggag tcaccctggt gtagctctat ggataggggg 7824 

tgctcacfcga gagagtcagg ggagatattc aaacagcaca agcataactg gtccccaaga 7884 

20 

taggaggaac agagagaccc ctgaaatcta aacagcctta actggtgcct ctgacaaggg 7944 

ctggccggcc agcgtcacag gacagctatg tgacgcttgt cccaaaggcc acactttgcc 8004 

25 tttccagatt tqcaqatgcc attatctcag gcctgtgaca tgctgggctg tctccctctg 8064 

attctaagtg aacacccctc tgagggggac atctttcctt ccaggcagat ttcatttgca 8124 

ccttgtacca LLgcccttgt gtgcggcaca ctgccagacg tccctgaact acttctttgc 8184 

30 

tcacatctgc ctgcccacca ctgcccatct gggcg tctca ttctcccntt cccagacccg 8244 

caatgccttg cccactttcc aagcaccagt cccggctgct ggccctccat cacatcctct 8304 

35 agaacooccc cacaaggctg atccgctgag ccaggcttcc cccaacctgc tgttggcatc 8364 

tctagtctct atcgtcttct ccactcacac agcatttgtt gtgcctcccc agacatctgc 8424 

Lttgatctct tttgtatttc taaatcactt gccactcgtg gcggcccccc ttccaccatg 8484 

40 

agactatcag ttccttqaqa qaagcgctca cctot-gcctt ctttcccttc tcccacctgt 8544 

ccctgtgccc agcccactgt tggctttcag ttaatgagtg tgtctgtgtg tgagacacac 8604 

45 ccacafcatgc aca tccacaa gccattcaca cacatggatc tgcacgttct cacaccccca 8664 

catttgcaaa cacacgtcag cttgcctcag cacctqagaa gqcagccacc tct-gtgagta 8724 

ccatgcccac aagtgacagt ctctgcattg atcgacgtcc tccctgtccc aatcccaggc 8784 

aggcagggct gcctctccac cagcctctcc accagccacc ctgcccctcc ctggccttcc 884 4 

ccttgccacc catgtcaagg gggcaggaca gtggccatca cccctacccc agqgcaaacg 8904 

55 aggaaagcaa agcctgaaac agtgagtggc atgacttacc aagggcgtcc acagttaaca 8964 
gccatttgac tttcttctqa cttccagggo ogggctgttt gcacctgcca ccgccttgcc 9024 
tacatccccg ctctcctgac ccagacttct gggccccagt ggggatacag acagagcaca 9084 
gttgcatcct gctgtcccag gcctccLtgg ccagcagcag ctgcccagaa tggtccacgc 9144 
acacaccccc tccctggctg tggctgagcc tcagaatcct ggggagctgc ggtctgtgcc 9204 
65 acaccacacc tcggtcacct cacccagagg tgtcacaagc cagcctcaca ctcctcttgc 9264 
aagcgagcag gggaaatccc ctctctgccc cacgagagga rntgccacag cttgctccag 9324 
aagccactcc aaggccgagc gcccaggatg gtgtcaacgt gctggaaact aacqggcaag 9384 
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10 



15 



20 



agaggcttgg 
ctqcaaaatg 
caaattcggc 
cagggacLLg 
tcggtgggtg 
caagagtagg 
gcagaacaca 
gggccctgcc 
cagacacaaa 
cccagcatcc 
ttcctccaaa 



gtcaccacct 
ggagtcctgq 
agctcagagg 
ccacataggg 
agaaagaagg 
agatgttgct 
gcagcacatg 
tggagaccag 
cacactcagc 
cccagggccc 
gggccccacc 



ccagggcagc 
gccaggcagc 
tggaaccacc 
ggttgaagtc 
ggaatcatgg 
gcagagggat 
gtgtcaggaa 
cccccgccct 
ccaccctcgt 
aqcccctgat 
cacctggccg 



ctcagggccc 
cataagccgc 
cacccacagg 
ttgcttgtgg 
ggccaagggc 
gaggctggga 
gcatggaggg 
cagactgccc 
gacctcccat 
cactgtgtgg 
agcgggtaga 



tcagagggga 
tccctagcac 
ggtcagagag 
catctgaagg 
agcagagccc 
ggaaatccac 
aagaggcctg 
cccaccagcg 
ggtggttcac 
gcttgtggca 
actgggggcc 



aacttgctca 
cacaggcacc 
gagagacggg 
catgcaggcc 
cacqcaqctt 
cagcctagta 
agaggaaagg 
cacacacaga 
ctggcccagc 
ggaggcgctc 
tggcagc 



9444 
9504 
9564 
9624 
9684 
9744 
9804 
9864 
9924 
9984 
10041 
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8 

6241 
DMA 



<213> Homo Sapiens 
<220> 

<221> misc feature 

<222> (1) .7(6241) 

<223> public sequence NTJ)08300.6 (339931 34 6171 c) 
<220> 

<221> exon6 

<222> (5010) (5278) 



<220> 
<221> 
<222> 



CDS 

(5012) . . (5164) 



<220> 

<221> mi 3 cofeature 

<222> (5150) . . (5150) 

<223> insertion-deletion site 

(insertion of a nucleic acid 'a' gives seqID 16) 

<220> - 

<221> 3'UTR 



<222> 
<223> 

<220> 
<221> 
<222> 



(5165) (5278) 
parti al 



polyA_signal 
(5273) (5278) 



<400> 8 

aggcacccag gctgggaact gcctgaaatc tccatgagcc ccaggcggac ctcctcaaga 
agcaagtccc agtccagccg aacatcagcc atcccaagca atgttgaaca catcgtgaag 
aaaccacaca gataagcccc caqattctac cccaaactcc agccaattgt tgccactacc 
aaatattgga atttatcact ggaagctaaa ctggcagctt gaagtctccc atcctcctga 
tccctttcct gccccaaccc cayctgcaga agggaggagg aggcctaggg gatgcactgg 
ctggataatt agaaccagct* tccactcttc agccctcctg ggtcctctct ccagaggaga 
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60 
120 
180 
240 
300 
360 



cgcaggtcac ccctcaccac cccgaaactc tgctggcatc ccctccctct cactcctgcc 420 

aaagtcaaag tgccccccac tacccgaaag cctctggagc ccgcagtatc agccactcag 480 

caaatctcct cccggagtaa ttggatgtga tggaacgagg gcttttcagg tcctgagtgg 540 

aagtctctgg atgtcattta atggcacaaa aaataaaagt ctcaggcctc ccgcctacca 600 

ccgccgggac atcattaatc tcggcagaaa ggtatcagtg cttctatcaa tgtggccggt 660 

cccgtgacga gccagacaaa ttacagacct gagccgggct ctctgtgggg cggttattag 720 

gctggagttg agtttttgtt ttttctcttc agggccacca tcccccaaca agaagcagtg 780 

gtgcccttac ctaagaggag aggcagcgag gccagggaag agggtgatca ccttg^gtag 840 

atacagatca ccacctgaag gaacagggct gttcggagcc ggagccagaa ccagtggccc 900 

ctgtccttac agagaccgcc ctgggacaag aggctgccgc agctccaacc tcaaggtctc 960 

agcaacccac tccttgttct ccagcccacc tgaacggggg cccatgggag aatggagctg 1020 

gyyLgggggg agcctccctc cctcagcttc ctaccccont- tcccaggcta agagggaagt 1080 

acatcaggat ggaaatacat caggacggaa atactgagga ccaaggacca gtcaggtggg 1140 

aaacgtccct tggctacagc agggtcaccc aggaactgct gtcctcggct gttgggaagt 1200 

cttaagagtg ggcctgtcta ggtgcagtga ctcacacccg taatcccagc actttgggag 1260 

gcccaggcag gaggatcact tgggccagga gtttgagatc agcctggaca acatagggag 1320 

accctgtctc tacagataat tttt ttttoa ttacccaggt gtggtagcac acaactgtgg 1380 

ttccaqctac tcaggaggct gaggcaagag gatcacctga gcctgggagg tcaaggctgt 14 40 

aatgagaggt gatagtgcca ctgcacccca gcctggagaa tggtcctgcc gaagggaatc 1500 

caggggtgtc actaggggtg gggaccttgt ctgctaggga cccagactac aaagttctac 1560 

tgctggccaa ggccaccctfc tgaaggacca gactgtgatt aggttagtat tacttgtatt 1620 

atattagaaa ctcgccaagg gcagggggta aaataaggca gctccgctga ccccatgcct 1680 

ggcagaggac cagggtgatg gctgccagtg ttaaaggggc cctggggtct gacctcatct 1740 

aagagcttgc acttctggca ggggagccga tcccccattc taccaagtcc tgcaaggggg 1800 

ccagggtaag ctggatgtgg cagagcacct ctcagtcLct cagatgcctg cttagcccta 1860 

actggcccaa tggagccgtc cagagtttgt ccagcacgtg acaaaccact cccaaacagt 1920 

ggaaagccaa ccccacagat ttgggaagct tggggtggag agagtgactg tcaagacctg 1980 

ccccagggcc ttttccaaac atfccctgcca tcatcocagt ccctgggact aaagctccga 2040 

ggaggagccc cttccccaqc fgctgaagac aggggacatc tttcccccct agagcctaat 2100 

aggaggccag gcacagagtg gtggcccaga aatcccccag tggctttaga taaggttatc 2160 

aggctctccc cttggccccc atctggaaca tcgctgaaat ctaacatctc tcatcactaa 2220 

catctttcct atcagcactq accagtaagt gagagtgcca ggactgcLqc atctggctat 2280 

gcaggatgag aacagcaccc cctacggttg tgcagggaLc aacctgtaca tcatatgtgg 2340 

cagccctgtc ctgagatgcc caggcctctg tgtcccagcc acagggcagg tqgctcatga 2400 
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gacaggcatc agacagcttc ctcctccttt 
ggcctctqaa gtgcccacac tgccoctacc 

5 atccaggata cagtggagtg gcccaagcac 
ccgcggactc tctgcctctc ggggacagca 
cqqtgaccac agaagccgca gcctcctagc 
ctgactcacc ccctctgcct tctagaactt 
ggagttgcgg ccaagccagg aaaggaggca 

15 ttcaggagct ggccagggag cccgagaacc 
accacgagtg ccccagggca ctgaaggctc 
ctctcgctcfc ggnctcaggg gatttcttcc 

20 

tcaattccct tcctgccgtg caggttcctt 
ccctgttcct gccacacacc aattcaccca 
25 gcgcacacac acgtgcaccc acacatacac 
ataataccat aattcctttc cttccagcca 
acagcccctc tgaaaggaca ctcaggLgac 

30 

tcaqtacacc tccagcggga gaggctttct 
actcctggaa ccccttgttt tccagccccc 
35 gtcgaatagg atttgaggga atctgcaaag 
tccttgccag taccccagct tgtgcctcag 
agtccLcccL ttttgggatt ggtgtgagga 

40 

tagtaccaac acataggtgg qqtgttactg 
ttttgggttt cagattaaaa attatcctct 
45 ctctgggagg cagcaacgcL Lcaatgggct 
gggtgctgac cgaggagaca ggtaggtggg 
gttctgagag cttccaaccc cttgccctct 

50 

cattaatcca ctgtgtatcg accactgc.ct 
cactctctct ccaccctggg aggttgctgc 
55 tagqaLgccc agtaaaattt gaattccaga 
atqctccatg cagaatgtgg gacacgctca 
aLtcaaatta aactacttcc tgccttttcc 

60 

ccagggtttg ggaaacaaga acgtgggaal 
ccatgcccca ggctgtgccc ttctcgccct 
65 agatqcgcct ccctgtacac acttgtgttt 
gggcaccgtc tgggtttctg caggcccaqr 
caccLclgcc atggggaagg cacactggtg 



ctcccttcag ccaaggcctc ccaccaccaa 2460 

ctcgtcccag gaagagcctg gcctaggtgc 2520 

agacagcagc gctgcccttt tcctgcctgg 2580 

gtgccataca ggaaggggct ttcctctgag 2640 

tctccagctc tccccacaca ccccaagggc 2700 

gccctctgtg gcaggaaaca gggggtgtaa 2760 

gcaagaggct agccggcacg gggactgggt 2820 

acagggctgt cgacttggga ctggaaagga 2880 

agaggcttcc gaactattga ggggagccag 294 0 

ccgcagcttg ccccagcctc gtgatgggtt 3000 

cctgccctac ggtggggaag gccccctcgt 3060 

taagcccgta ttaccctggc atgtgcgcgt 3120 

atgtgtgtgc acacactcgt tcccagggtc 3180 

gctgggctcc tagggtgcag aggctcaggc 3240 

actgagcctc ctaggagaac ccggctgcag 3300 

cagcaccttg qccacccttt ctgcaaaaga 3360 

ctgggcccca tctgagcatg cagtcactga 3420 

gcctttcccc atccctgcca ctacgaagct 3480 

tcatcccttc tgtgaactgg gaatgggaac 354 0 

ttaagcgaat tatgtgtcaa gctcttacag 3600 

ctattnttgt ttgcattatt gttattgtta 3660 

ccttgagaac atctttgaaa tgtcagcagc 3720 

tgctatctta attgctatga aaagggtggt 3780 

agggtgggtg gcccqtqagt ctcaagtggg 3840 

tctgtctggg tatcagaggt gactggtctc 3900 

agctctgctc cggauagggL cttccccagg 3960 
atttagcaaa acaacaacaa caacaaaaag 4020 
taaacaacgg aaaaaaaaat tcatgtaagg 4080 
caccaacaag cgatttgcta tttatctgag 4140 
tggcaagcat ctgcccaggg tgcctqacat 4200 
algccLctga ccacttccct cagcctcact 4260 
gccacaaggc ctgttcacac ttgccccaga 4320 
tctccccctg aggaggaacc cctccgacct 4380 
tcccctigcca gggtccaggg gctgccccLc 4440 
caccccctgt cctgggacag agacqaqqaq 4 500 
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gtgtttgggg tttaggggag aagattgaaa gaagagcaca gatacttcca ccaatctcag 4560 

ccccagcagc aggggtgagc aggtgtgaag gaaaqtagcg ggtgaagcct cctgtcctgg 4 620 

ggtcagggga gcctgcaggg agagggcatt agtagaggaa gtgggcagcc cgcttgactg 4 680 

ggcttccttt ctgcccggga gcgagaaggc agttttgcta aacagaggtg ctggggaagg 4740 

10 agattcagac attctcaact ggggctgggc tctcagtccc cattgccctc tggggagcag 4800 

gcagctctgc tgctgltcag aggctctgct aacatctggg ggctacctct gcaacaggag 4860 

ccggaagctc ccccagccac tacgccccat ggccatcgca tcacctgctg cctcggctcc 4 920 

15 

cacgaqccgg cctccctctc aggcctctgc cagctttcaa gtccatggct ctttctcact 4980 

gcctccctgt ggtctgtttg tccttccagg c tct tgg aaa atg gac aga ttt 503? 

Sor Trp Lys Met Asp Arg Phe 
20 15 

ctg aac cgc ttc cac ctg ggc gaa cct gaa gca age acc cag ttc atg 5080 
Leu Asn Arg Phe His Leu Gly Glu Pro Glu Ala Ser Thr Gin Phe Met 
Z0 15 20 

25 

acc cag aac tac cag gac tea cca acc etc cag get ccc aga gga agg 5128 
Thr Gin Asn Tyr Gin Asp Ser Pro Thr Leu Gin Ala Pro Arg Gly Arg 
25 30 35 

30 gcc age gag ccc aag cac aaa acc agg egg aga tag ctgcctgcta 5174 
Ala Ser Glu Pro Lys His Lys Thr Arg Arg Arg 
40 45 50 

gatagcegge tttgccatcc gggcalgtgg ccacactgct caccaccgac gatgtgggta 5234 

35 

tggaaccccc tctggataca gaaccccttc ttttccaaat taaaaaaaaa aatcatccag 5294 

ggcttggtgc tttgtatcct agcttgtatt cctttgaaaa gcaacaggag gaaattggcc 5354 

40 aatatccagc actggggact gaggctttcc tglaaatgca aggggttggg tggggagtgg 5414 

ggqqaataga eggacaggea ggatatagtg atccctacgc ctgaactgtc actgagtege 5474 

tgtgagacct cacggaaacc Lcttaccctc cccaaagttc tgctcccttg gttgtgacac 5534 

45 

ggaggtagga agaagtggga gagggtggtt ggtttatcac cacgggcctt tccagctctg 5594 

acattcqaca aegttcatte agegactatt gttgcatgtg gggtgcacct ggccttcctc 5654 

50 caggcactgg agatgeatge ctgcLLcatc cccatgcatg ggtagatagg gcagacacog 5714 

cotggtgcfcc acctcatgga gcttccagtc cagcaqqgga agcaaatatc cccaacaaat 5774 

gataaactgt gataagtatg atgagggtgc aaagggggee tgatttctga eggagagaaa 5834 

55 

gggttgaaga ctcaggcaga gecctgatgt ftaactlcag ggctgaaaag tgqgctaggg 5894 

gttaccaggt gagtggagac aggaatagag aacattccaa agctcctccg agcccctgtg 5954 

60 gctggcaggg geatgeatgg catcLLccag gaaatgaaag gcccatgctg tggclgccgc 6014 

aaaglatgaa agggcaagaa cagtatgacg gqtgggcaaa ggccagacca tgccaggcct 6074 

ttgttgccat gttaaagctc tggattgtat ccagaaagca gtgggaaatc attaaatgat 6134 

65 

aagtagattg ggecaggcat ggtggctcgt gcctataalc ccagtgcttt ggqaggccaa 6194 

qqccagagga tegcttgaag ccaaqagttc aagaccagcc tgggcag 6241 



110 



M39^S76 .D7QDOS 
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<210> 9 

<211> 14334 

<212> DNA 

5 <213> Homo Sapiens 

<220> 

<221> misc_feature 

<222> <1) .7(14334) 

10 <??3> public sequence AC105046.2 (20000 34333 c) 

<220> 

<221> exonS 

<222> (1023).. (1136) 

<220> 

<221> CDS 

<222> (1025) . . (1135) 

20 <220> 

<221> exon6 

<222> (12443) . . (12711) 

<223> seqID 1 and seqID 2 

25 <220> 

<2?1> CDS 

<222> (12445) . . (12609) 

<220> 

30 <221> 3'UTR 

<222> (12610) . . (12711) 

<223> partxal 

<220> 

35 <221> polyA_signal 

<222> (12706) . . (12711) 



40 



50 



60 



<400> 9 

aattgtaata cgactcacta lagggcgaat tccccggttg ttggcatgct ggaagccacc 60 



agLggccgga agtgcacagt gagggactct tccaqtgctc naagcagcaa gtcgtgcttt 120 

tgtttaatga tcaqqagagg aaagaagggg agatggccat aaaggaccaa tactccctcc 180 

45 caccaacatc ctcattgcaa atgcttcttt cccacccacc cagaaagatt cttggctcca 24 0 

gcctggctct cctccttgga aaggaagatg gagagcggga aggggagtgg gaaggggtga 300 

agcactctgg gaaagaggcc caccttgaaa gacctaggcc aggccagcag gaccttgact 360 

ctgcatggac ttggggaaag aatgagggat gtccccnccc cacttccccc gccccaaaaa 420 

aagagctcag cctcctggag ggagagagaa ggactgggaq agggaacttc tccctqatgc 4 80 

55 cccgagcacc cagctcctac agctcccatg atcaaacctt tgaacccagt cccacccacc 540 

rtcgtaacca cLLccagacc ctccagacgc agccaagaaa ttqgcttctt fcggggattga 600 

gttcataagc aaatqccgaa aaaataaata aaaatcctct gccaaacaca ctcacagagg 660 

aagatgggct tctcttgagg caggaaatcc atcagydtdq actaaagcaa aaacaggaca 720 

atgccoacac ccactggccc ataaaggcct gggagccacg ccagcaccca aggccttcaa 780 

65 gccagctcag ggtgtttcct cttggaccac ctcctccccc actctcaccc ccaccaaaat 840 

atgtacLLct caaatgaaaa tactgaagct gagatcccgc caaatacaaa tcctaatgLL 900 

ttgcagggcc gaqt qctgot ctggggcccc agggaggtca gcaggaccat gggaaggcag 960 
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agagctctgc cctcgglagt atgggatgtt gatgctcacc tctctcctgc ttctacctat 1020 

agcc tac cag get ccc tec cca ccg gca cac agt ggc fctc cat. cgc tac 1069 
5 Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe His Arg Tyr 

15 10 15 

cag ttc ttt gtc tat cLL cag gaa gga aaa gtc ate tct etc ctt ccc 1117 
Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val He Ser Leu Leu Pro 
10 20 25 30 

aag gaa aac aaa act cga ggtaagacct ccccatcctt ccagccacca 1165 
Lys Glu Asn Lys Thr Arg 
35 



15 




ttctqqgqtq 


gaacaaaege 


ttcaggcatt tacggggaaa ttggggcctt 


1225 




gacccccact 


catgagcett 


qqaqactaqa 


gtcgaaggct ggtgtttctt gggagtgaac 


1285 


20 


taaaaaaaot 


gecctaaaga 


tgttggcatc 


aggtgtggtc tcccttgaga agtaagtccc 


1345 




acagaccagt 


otaaaacrctc 


caottqaqtq 


ggttggtatg taaggcaggg tctcaaagca 


1405 


25 


era t aac t oca 


qccaqqqaqa 


agagtacccc 


gaaactttgc atggcatgtg caggcagctg 


14 65 


cccctgaact 


ttggcaagtc 


attgtgctct 


gtaattgagt ctgatctqqg caagagggee 


1525 




ctggaatgcc 


accctgccag 


tctatcctca 


tgggttgcta attactgggt cggctgtgcc 


1585 


30 


atctcagtga 


tqaqqqqaaq 


atgtccccca 


ccatcccccc tccactagct tctctgagca 


1645 




gccctccttg 


gcttctggag 


cagaccaact 


ccctatagcc tcctaatggc gaaccccgcc 


1705 


• 

35 


tgcttgyect 


cggtcctcct 


gccaccaggg 


ccagggagca ggagcggtgg ccacttggcc 


1765 


actgeagcat 


gecacaagtg 


tctcacctga 


cccagtgtga gccatgcccc cgactccccc 


1825 




acccgccatc 


tgagecttag 


atttctcttg 


cagaaaatgg gaaataaagc cacccctgca 


1885 








yyydciyyyclci 


yov^oci Lcjy ay aULC)dal.cict(> y^L.ovuuyya 


1945 




agtgcaagaa 


gcacatcccc 


caggctggca 


ggaaggcaaq tgggagggtg ggtgtcgtca 


2005 


45 


gccatttcct 


ctcacccagt 


gacctcgacc 


aaacatccac ccactaggaa gctggcacag 


2065 


gtattgetae 


aaccgegotg 


ccggaggccc 


cacaggagag gaacatggag cccagaaagg 


2125 




cacaggttga 


gegacaggge 


ccttcagatg 


tgccgagcag ggcccaccta tcacagaggg 


21B5 


50 


ctccdtjqcoa 


ecctgggagt 


ggagctggtg 


caaacttacc accagtgaac cttccccagg 


2245 




tgccctcagc 


accttgccaa 


teegtgaate 


tgtttaaacg getttgecaa agacatactt 


2305 


55 


ttaagtttag 


atgeagagag 


aaggaaacaa 


aattatgtat cttgcccaga ccaagaaaaa 


2365 


ctgtctccag 


raccct* ggct 


accgatcLua 


ccaaagatgt egggacctca gectatgaca 


2425 




ttctcccacc 


catggggttc 


tcataaaatg 


aagcagtgcc tctgagcccc accatgagac 


2485 


60 


tccgggaLca 


ccatgcctat 


gcccagggcg 


ggagtctctc gggggtgaag tgctttctga 


2545 




aggcaggatg 


gagggaagga 


tagagcatcg 


ggagatgeca tgaggaggga cagacaaggg 


2605 


65 


agaggaaggg 


agetgeatae 


accaaagacg 


cttccctact ctgcgtgctg gggctcctga 


2665 


ggctcgacca 


rrfgcctgLcc 


Lcuagaaacc 


acagtggcct gagggaccag cagaggectg 


2725 




acccacccct 


cccaagatgt 


tggtgaccag 


gcaaggaagc caaaacqcgc tgggcctgaa 


2785 
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acgcgttgtt tggtcttgtg tacttcccca 
gggaacttag aaactggact tgcatctgta 
5 cqcggtggct cacacctata atcccagcac 
gtcaggagat tgogaccatc ctggctaaca 
aaaaaaaaaa aaaattagcc cagcgtggcg 

10 

ggctgaqqca ggagaacagc atgaacctgg 
gccactacac tccagcctgg ggacagagag 
15 aaatggccct cccagatgaa agccccagcc 
catgtgccca gcgctgtgct yggaacttca 
ttccctgtcc ccaagtggct tacagtctac 

20 

aggttgtaat taaacactta attgtgtgat 
gactcaggga tagatctctt cccttcctgt 
25 ctccagccag agaggatata aaggtgtttt 
ctctctagtt agtagtttca ctttgaggcc 
ctaatctaga aaataagggg ttggctccag 

30 

catggggagg aggttggagc agggctgggc 
gtgaaggaag agggagctct gggttggaga 
35 aatggagggg aaggggaaaa acaaggttct 
ggttgactct gggaagccgt tagaaattat 
ctttggagac gacgaacctg gaggggcagg 

40 

ccaccagagg ggtcgagaca tgaggcagtg 
aagcagaaac cccaggtcct tgcacgatgt 
45 gcacctggac ggtgcctggg gacttacgga 
agagcatggg gtcctgccaa tgaacacagg 
tccaggggag agaatccgtg gaggaagcag 

50 

glalaaccag ctgcttgtgt cccctgggtg 
ggggaaaggt cagcctacag gagccaggta 
55 accccatcct gttcaggcct cttccctccc 
gtgaagaagg aggcttgaga tgattctagt 
ggggatggca ttgttcaqac ccaaggaggt 

60 

gtgaccacag gacaatgtac atgtggtcag 
cgtttgacct tggccaggtg acctcccctc 
65 acagggttag actagttgat ctgcaattga 
ccggaatgtc ctggcctccc tgtLcLgcca 
gactttccca gccagccctg atacttccta 



ccccacccct tgacccacct tttgtccatg 2845 

cagggtttaa aatggccctc ccaggctggg 2905 

tttgggagac tgaggtgggc ggatcacgag 296b 

tggtgaaacc ctgtctctac taaaaataca 3025 

gtgggcqcct atagtcccag ctactccgga 3085 

gaggcggagc ttgtagtgag ctgagatcgc 3145 

agactccgtc tcaaaaaaac aaaaaaaaaa 3205 

aaggtcccag tcttgccatg tgatgccttc 3265 

gaggagacac aatcaagggc accaggaatt 3325 

tttggaaaca attaaacaac aatgcaaagc 3385 

tctaactgta aggacaattg gaagtctqtg 344b 

catacctttc cctccagaac actgtgctct 3505 

attgtagaqt gaggoaatct ggactcccct 3565 

aatcaatgaa cctcgctggg cctcagtttc 3625 

atgatctaca aatttctctg gccctgaggc 3685 

cctagagaag gaggagatct gaat-ccttga 3745 

aaccatagat tttttttttt taagtgagaa 3805 

cagnagtccg gcagaggagt ggcttttgca 3865 

ttaccagaga agcaacactc aaaaqtcact 3925 

cagaactgac cagaatgaga ggacaggaca 3985 

aggtgtggat gaacccaggg gcgattgcaa 4045 

tgaaacactc agggcctggt gctctctgaa 4105 
gcccagggct ggaccagtgc ttgggaggcc 4165 
tcccaggact tgccttgatc tagatttctc 4225 
ggtgtcaggg gttgggaggg gggcagtgtg 4285 
aacaggcagq aaaagctLct agagactcga 4345 
gaggggcgtc tatggcagac gggccactqa 4405 
Lggatcttat gggtaaatga caggcttagg 4465 
agagcctgtc atqactatgt agggggatgt 4525 
gaaaccctcc ccaggattcc tctttttact 4585 
caaatgagat cttgatgtcc cgactctgat 4645 
cctgggtctc aattttgacc tcatcaagat 4705 
atgacctgaa attgattlgc tcccagcttc 4765 
aataaatgca gggaccaact gatgaggcLc 4825 
gtagaggtct tgcatcagac cccacgaggc 4885 
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ftD3^TO7fi .07QS02 



15 



25 



caggagcaca gcaagggtcg tggaggggct tagagctacc ccacCtctag ctgatgaaac 4945 

cctgccgtgc caccccactc ctcctacctt tgcacagctc cfctcctctct ggccactttc 5005 

5 

aaaccacgcc tggcataact cagtaatgcc ttctgcattc gctcagctga tgccagatgc 5065 

cfcagtgtata cttggcgctg ttttgggLgc tgtaagaagt caccatgtga tggtgtccct 5125 

10 gctctcatgc atgtgtggag tcaccctggt gtagctctat ggataggggg tgctcactga 5185 

gagagtcagg ggagatattc aaacagcaca agcataactg gtccccaaga taggaggaac 524 5 

agagagaccc ctgaaatcta aacagcctta actggtgcct ctgacaaggg ctggccggcc 5305 

agcgtcacag gacagctatg tgacgcttgt cccaaaggcc acactttgcc tttccagatt 5365 

tgcagatgcc attatctcag gcctgtgaca tgctgggctg tctccctctg attctaagtg 5425 

20 aacacccctc tgagggggac atctttcctt ccaggcagat ttcat.ttgca ccttgtacca 5485 

ttgcccttgt gtgcggcaca ctgccagacg tccctgaact acttctttgc tcacatctgc 554 5 

ctgcoeacca ctgcccatcL gggcgtctca ttctcccctt cccagacccg caatgccttg S605 

cccactttcc aagcaccagt cccggctgct ggccctccat cacatcctct agaacaaccc 5665 

cacaaggctg atccgctgag ccaggcttcc cccaacctgc tgttggcatc tctagtctct 5725 

30 atcgtcttct ccactcacac agcatttgtt gtgcctcccc agacatctgc tttgatctct 5785 

tttgtatttc taaatcactt gccactcgtg gcggcccccc ttccaccatg agactatcag 5845 

ttccttgaga gaagcgctca cctctgcctt ctttcccttc tcccacctgt ccctgtgccc 5905 

agcccactgt tggctttcag ttaatgagtq tgtctgtgtg tgagacacac ccacatafcgc 5965 

acatccacaa gccattcaca cacatggatc tgcacgttct cacaccccca catttgcaaa 6025 

40 cacacgtcag cttgcctcag cacctgagaa ggcagccacc tctgtgagta ccatgcccac 6085 

aagtgacagt ctctgcattg atcgacgtcc tccctgtccc aalcccaqgc aggcagggct 6145 

gcctctccac cagcctctcc accagccacc ctgcccctcc ctggccttcc ccttgccacc 6205 

catgtcaaqq qggcaggaca gtggccatca ccccbacccc aqqgcaaacg aggaaagcaa 6265 

agcctgaaac agtgagtggc atgacttacc aagggcgtcc acagttaaca gccatttgac 6325 

50 tttcttctga cLLccaggga agggctgttt gcacctgcca ccgccttgcc tacatccccg 6385 

ctctcctgac ccagacttct gggccccagt ggggatacag acagagoaca gttgcatcct 6445 

gctgtcccag gcctccttgg ccagcagcag ctgcccogaa tggtccacgc acacaccccc 6505 

tccctggctg tggctgagcc tcagaatcct ggggagctgc ggLclylycc acaccacacc 6565 

tcggtcacct cacccagagg tgtcacaagc cagcctcaca ctcctcttgc aagcgagcag 6625 

60 gggaaalccc ctctctgccc cacgagagga catgccacag cttgctccag aagccactcc 6685 

aaggccgagc gcccaggatg gtqtcaacgt gctggaoact aacgggcaag agaggcttgg 6745 

gtcaccacct ccagggcagc ctcagggccc tcagagggga aacttgctca ctgcaaaatg 6805 

ggagtcctgg gccaggcagc cataagccgc LcccLagcac cacaggcacc caaattcggc C865 

agctcagagg tggaaccacc cacccacagg ggtcagagag gagagacggg cagggacttq 6925 



35 



45 



55 



65 



114 



ccacataggg ggttgaagto ttgcttgtgg 
agaaagaagg ggaatcatgg ggccaagggc 
5 agatgttgct gcagagggat gaggctggga 
gcagcacatg gtgtcaggaa gcatggaggg 
tggagaccag cccccgccct cagactgccc 

10 

cacactcagc ccaccctcgt gacctcccat 
cccagggccc agcccctgat cactgtgtgg 
15 gggccccacc cacctggccg agcgggtaga 
ccatcccggg caacactcag acaggcaccc 
ccccaggcgg acctcctcaa gaagcaagtc 

20 

caatgttgaa cacatcgtga agaaaccaca 
ccagccaatt gttgccacta ccaaatattg 
25 ttgaagtctc ccatcctcct gatccctttc 
ggaggcctag qggatgcact ggctggataa 
tgggtcctct ctccagagga gacgcaggtc 

30 

tcccctccct ctcactcctg ccaaagtcaa 
gcccgcagta tcagccactc agcaaatctc 
35 gggcttttca ggtcctgagt ggaagtctct 
gtctcaqqcc tcccgcctac caccgccggg 
tgcttctatc aatgtggccg gtcccgtgac 

40 

ctcLctgtgg ggcggttatt aggctggagt 
catcccccaa caagaagcag tggtgccctt 
45 agagggtgat caccttgggt agatacogat 
ccggagccag aaccagtggc ccctgtcctt 
gcagctccaa cctcaaggtc tcagcaaccc 

50 

ggcccatggg agaatggagc tggggtgggg 
cttcccaggc taagagggaa gtacatcagg 
55 gaccaaggac cagtcaggtg ggaaacqtcc 
ctgtcctcgg ctgttgggaa gtcttaagag 
cgtaatccca gcactttggg aggcccaggc 

60 

Lcagcctgga caacataggg agaccctgtc 
gtgtqgtagc acacaactgt ggttccagcc 
65 qagcctggga ggtcaaggct gtaatgayacj 
oorggLcctg ccgaagggaa tccaggggtg 
gacccagact acaaagttct actgrtggoc 



catctgaagg catgcaggcc tcggtgggtg 6985 

agcagagccc cacgcagctt caagagtagg 7045 

ggaaatccac cagcctagta gcagaacaca 7105 

aagaggcctg agaggaaagg gggccctgcc 7165 

cccaccagcg cacacacaga cagacacaaa 1 7225 

ggtggttcac ctggcccagc cccagcatcc 7285 

gcttgtggca ygaggcgctc ttcctccaaa 7345 

actgggggcc tggcagccac ccagctgggg 7405 

aggctgggaa ctgcctgaaa tctccatgag 7465 

ccagtccagc cgaacatcag ccatcccaag 7525 

cagataagcc cccagattct accccaaact /58b 

gaaLttatca ctggaagcta aactggcagc 7645 

ctgccccaac cccagctgca gaagggagga 7705 

ttagaaccag cttccactct tcagccctcc 7765 

acccctcacc accccgaaac tctgctggca 7825 

agtgcccccc actacccqaa agcctctgga 7885 

ctcccggagt aattggatgt gatggaacga 7945 

ggatgtcnt-fc taatggcaca aaaaataaaa 8005 

acatcattaa tctcggcaga aaggtatcag 8065 

gagccagaca aattacagac ctgagccggg 8125 

tgagtttttg ttttttctct tcagggccac 8185 

acctaagagg agaggcagcg aggccaggga 824 5 

caccacclga aggaacaggg ctgttcggag 8305 

acagagaccg ccctgggaca agaggctgcc 8365 

actccttgtt ctccagccca cctgaacggg 8425 

ggagcctccc tcccLcagct tcctacccca 8485 

atggaaatac atcaggacgg aaatactgag 8545 

cttggctaca gcagggtcac ccaggaactg 8605 

tgggcctgtc taqgtgcagt gactcacacc 8665 

aggaggatca cttgggccag gagtttgaga 8725 

tctacagata atLLtttttt aattacccag 8705 

actcaggagg ctgaggcaag aqgatcacct 8845 

gtgatagtgc cactgcaccc cagcctggag 8905 

tcactagggg tggggacctt ytcLgclagg 8965 

aaggccaccc tttgaaggac cagactgtga 9025 

U5 



25 



ttaggttagt attacttgta ttatattaga aactcgccaa gggcaggggg taaaataagg 9085 

cagctccgct gaccccatgc ctggcagagg accagggtga tggctgccag tqttaaaggg 9145 

5 

gccctggggt ctgacctcat ctaagagctt gcacttctgg caggggagcc gatcccccat 9205 

tctaccaagt cctgcaaggg ggccagggta agctggatgt ggcagagcac ctctcagtct 9265 

10 ctcagatgcc tgcttagccc taactggccc aatggagccg ticcagagttt gtccagcacg 9325 

tgacaaacca ctcccaaaca gtggaaagcc aaccccacag atttgggaag cttggggtgg 9385 

agagagtgac tgtcaagacc tgccccaggg ccttttccaa acattcctgc cat-catcaca 9445 

15 

gtccctgqga ctaaagctcc gaggaggagc cccttcccca gctgctgaag acaggggaca 9505 

tctttccccc ctagagccta ataggaggcc aggcacagag tggtggccca gaaatccccc 9565 

20 agtggcttta gataaggtta tcaggctctc cccttggccc ccatctqgaa catcgctgaa 9625 

atctaacatc tctcatcact aacatctttc ctatcagcac tgaccagtaa gtgagagtgc 9685 

eaggactgct gcatctggct atgcaggatg agaacagcac cccctacggt LgLgcaggga 9745 

tcaacctgta catcatatgt ggcagccctg tcctgagatg cccaggcctc tgtgtcccag 9805 

ccacagggca ggtggctcat gagacaggca tcaqacagct tcctcctcct ttctcccttc 9865 

30 agccaaggcc tcccaccacc aaggcctctg aagtgcccac actgccccta ccctcgtccc 9925 

aggaagagcc tggcctaggt gcatccagga tacagtggag tggcccaagc acagacagca 9985 

gcgctgccct tttcctgcct ggccgcggac tctctgcclc tcggggacag cagtgccata 10045 

caggaagggg ctttcctctg agcggtgacc acagaagccg cagcctccta qctctccagc 10105 

tctccccaca caccccaagg gcctgact ca ccccctctgc cttctagaac ttgccctctg 10165 

40 tggcaggaaa cagggggtgt aaggagttgc ggccaagcca ggaaaggagg cagcaagagg 10225 

ctagccggca cggggactgg gtttcaggag ctggccaggg agcccgagaa ccacagggct 10285 

gtcgacttgg gactggaaag gaaccacgag tgccccaggg cactgaaggc tcagaggctt 10345 

ccgaacLatt gaggggagcc agctctcgct ctggactcaq gggatttctt ccccgcagct 10405 

tgccccagcc tcgtgatggg tttcaattcc cttcctgccg tgcaggttcc ttcctgccct 104 65 

50 acggtgggga aggccccctc gtccctgttc ctgccacaca ccaattcacc cataagcccg 10525 

tattaccctg gcatgtgcgc gtgcgcacac acacgtgcac ccacacatac acatgtgtgt 10585 

gcacacactc gttcccaggg tcataatacc ataattcctt tccttccagc cagctgggct 10645 

cclagggtgc agaggctcag gcacagcccc tctgaaagga cactcaggLg acactgagcc 10705 

tcctaggaga acccgqctgc agLcagtaca cctccaccgg gagaggcttt ctcagcacct 10765 

60 tggccaccct ttctgcaaaa gaactcctgg aaccccLLgt tttccagccc ccctgggccc 10825 

catclqagca tgcagtcact gagtcgaata ggatttgagg gaatctgcaa aggc:ct"ttcc 10885 

ccatccctgc cactacgaag cttccttgcc agtaccccag cttgtgcctc agtcatccct 10945 

tctglgaact gggaatggga acagtcctcc ctttttggga ttgglgtgag gattaagcga 11005 

attatgtgtc aagctcttac agtagtacca acacataggt ggggtgttac tgctallatt 11065 
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65 



116 



10 



20 



30 



gtttgcatta tLgttattgt tattttgggt ttcagattaa aaattatcct ctccttgaga 11125 

acatctttga aatgtcagca gcctctggga ggcagcaacg cttcaatggg cttgctatct 11185 

taattgctat gaaaagggtg gtgggtgctg accgaggaga caggcaggtg ggagggtggg 11245 

tggcccgtga gtctcaagtg gggttctgag agcLlccaac cccttgccct cttctgtctg 11305 

ggtatcagag gtgactggtc tccattaatc cactgtgtat cgaccactgc ctagctctgc 11365 

tccggacagg atcttcccca ggcactctct ctccaccctg ggaggttgct gcatttagca 11425 

aaacaacaac aacaacaaaa agtaggatgc ccagtaaaat ttgaattcca gataaacaac 114 85 

15 ggaaaaaaaa attcatgtaa ggatgctcca tgcagaatgt gggacacgct cacaccaaca 11545 

agcgatttgc tatttatcLg agattcaaat taaactactt cctgcctttt cctggcaagc 11605 

atctgcccag ggtgcctgac atccagggtt tgggaaacao goncgtggga atatgcctct 11665 

gaccactt-cc ctcagcctca ctccatgccc caggctgtgc ccttcttgcc ctgccacaag 11725 

gcctgttcac acttgcccca gaagatgcgc ctccctgtac acacttgtgt tttctccccc 11785 

25 tgaggaggaa cccctccgac ctgggcaccg tctgggtttc tgcaggccca gctcccccgc 11845 

cagggtccag qggctgcccc tccacctctg ccatggggaa ggcacactgg tgcaccccct 11905 

gtcctgggac agagacgagg aggtgtttgg ggtttagggg agaagattga aagaagagca 11965 

cagatacttc caccaatctc agccccagca gcaggggtga gcaggtgtga agqaaagtag 12025 

tgggtgaagc ctcctgtcct ggggtcaggg gagcctgcag ggagagggca ttagtagagg 12085 

35 aagtgggcag cccgcatgac tgggcttcct ttctgcccgg gagcgggagc gagaaggcag 1214 5 

ttttqctaaa cagaggtgct ggggaaggag attcagacat tctcaactgg ggctgggctc 12205 

tcagtcccca ttgccctctg gggagcaggc agctctgctg ctgttcagag gctctgctaa 12265 

catclggggg ctacctctgc aacaggagcc ggaagctccc ccaqccacta cgocccatgg 12325 

ccatcgcatc acctgctgcc tcggctccca cgagccggcc tccctctcag gcccctgcca 12385 

45 gctttcaagt ccatggctct ttctcactgc ctccetglgg tctgtttgtc cttccaggc 12444 

tct tgg aaa atg gac aga ttt ctg aac cgt ttc cac ctg ggc qaa cct 12492 
Ser Trp Lys Met Asp Arg Phe Leu Asn Arg Phe His Leu Gly Glu Pro 
40 45 50 

50 

gaa gca age ace cag ttc atg ace cag aac tac cag gac tea cca acc 12540 
Glu Ala Ser Thr Gin Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro Thr 
55 60 65 

55 etc cag get ccc aga gaa agg gcc age gag ccc aag cac aaa aac cag 12588 
Leu Gin Ala Pro Arg Glu Arg Ala Scr Glu Pro Lys Has Lys Asn Gin 
70 75 80 85 

gcg gag ata qct gcc tgc tag atagcegget ttgccatccg ggcatgtggc 12639 
60 Ala Glu lie Ala Ala Cys 

90 

cacactgccc acraccgacg atgtgggtat ggaaccccct ctggatacag aaccccttcL 12699 

65 tttccaaata aaaaaaaaat catccagggc ttggtgetLt gtatcctagc ttgtattcct 12759 

Ltgaaaagca acaggaggaa attggccaat atccagcact ggggactgag gctttcctgt 12819 

■t 

aaatgcaagq ggtLqggtgg ggagtggggg gaatagaegg acaggcagga tatagtgatc 12879 

117 



40 



is 



25 



35 



cctatgcctg aactgtcact gagtcgctgt gagacctcac ggaaacctct taccctcccc 12939 

aaagttctgc tcccttggtt gtgacacgga ggtaggaaga agtgggagag ggtggttggt 12999 

5 

ttatcaccac gggcctttcc agctctgaca ttcgacaacg ttcattcagc gactattgtt 13059 

gcatgtgggg tgcacctggc cttcctccag gcactggaga tgcatgcctg cttcatcccc 13119 

10 atgcatgggt agatagggca gacacagcat ggtgctcacc tcatggagct tccagtccaq 13179 

caggggaagc aaatatcccc aacaaatgat aaactgtgat aagtatgatg agggtgcaaa 13239 

gggggcctga tttctgacgg agagaaaggg ttgaagactc aggcagagcc ctgatgttta 13299 

acttcagggc tgaaaagtgg gctaggggtt accaggtgag tggagacagg aatagagaac 13359 

attccaaagc tcctccgagc ccctgtggct ggcaggggca tgcatggcat cttccaggaa 13419 

20 atgaaaggcc catgctqtqg ctgccgcaaa gtatgaaagg gcaagaacag tatgacgggt 13479 

gggcaaaggc cagaccatgc caggcctttg ttgccatgtt aaagctctgg attgtatcca 13539 

gaaagcagLg ggaaatcatt aaatgataag tagattgggc caggcatggt ggctcgtgcc 13599 

tataatccca gtgctttggg aggccaaggc cagaggatcg cttqaaqcca agagttcaag 13659 

accagcctgg gcagcatagg gagaccccca tccctacaaa aataagaata aaaaattagc 13719 

30 caggtatggt ggtggatgcc tgtagtccca cctactcagg aggctggcat aggagaatca 13779 

cctgggccca ggagtttgag actgcagtga tctatgaatc tgtgattgca ccactgccct 13839 

ccagcctggg caacccagca agacctcatc tctaaaaaat tcataaaata aaataaaata 138 99 

aaaataqatt qqagaggcag atggcat-aar cagatgtatg cttttagaaa gctgtgaata 13959 

acgccaggtg cggtggctca cacctataat cccagcactt tgggaagcca aggtgggcaa 14019 

40 atcaccLgag gtcaggagtt cgagatcagc ctgactaacg tgaaaccctg tctgtactaa 14079 

aaatacaaaa attagcccgg cgtaatqgtq qqcgccfcgta atcccagcta cttgggaggc 14139 

tgaagcagga gaatcgcttg aacccgggag gcagaggttg cagtgagccg agattgcacc 14199 

octgcactcc ogrctgggcg acaaagtgag actctgtctc aaaaaaaaaa aaaagaaaga 14259 

aagaaaaaaa gaaaagaaag cggtgaataa attcttaggt atctgcaagg aaaggacttt 14319 

50 tgccatctaa cttcc 14334 

<210> 10 

<211> 131 

<212> DNA 

55 <213> Homo Sapiens 

<220> 

<221> raisc-f eature 

<222> (1).,(131) 

60 <223> exon 1 

<220> 

<221> CDS 
<222> (1)..(129) 

<400> 10 

atg ggt tgg aca atg agg ctg gtc aca gca gca ctg tta ctg ggt etc 48 

Met Gly Trp Thr Met Arg Leu Val Thr Aln Ala Leu Leu Leu Gly Leu 
1 5 10 15 
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gag gcc etc ttg gac gag gac acc etc 
Glu Ala Leu L«a Asp 2x2 Asp ?hr Leu p£e Cvs " 
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ccg ata gtc aag ttc ccg ggg gcc qtq aac 
Pro lie Val Lys Phe Pro Gly Ala Sal As£ 
35 40 



30 



127 



40 



60 
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99 
DNA 

Homo Sapiens 



fltxsc_feature 
(1) - . (99) 
oxon 3 



<220> 

<221> CDS 
50 <222> (1)..(99) 



<400> 12 

s5 is a a a a a a a a a a a s a » a 

10 15 



s a a a a a ss a s s a a a a a a 

25 30 
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Lys 
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65 <210> 13 

<211> 46 

<212> DNA 

<213> Homo Sapiens 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<2?3> 



roisc__feature 
(1).. (46) 
exon 4 



misc_feature 
(16) (16) 
predicted SNP site 

a is replaced by g in seqID 17 <176. 
and in seqID 18 (115- .115) 



.176) 



<220> 

<221> CDS 
15 <222> (1)..(45) 

<400> 13 

ggc gcc gac ctg aag aaa ggg aag att cag ggc cag gag tta tea g 
Gly Ala Asp Leu Lys Lys Gly Lys lie Gin Gly Gin Glu Leu Ser 
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<210> 


14 


<211> 


114 


<212> 


DNA 


<213> 


Homo 


<220> 




<22i> 


misc 


<222> 


(1)." 


<223> 


exon 


<220> 




<221> 


CDS 


<222> 


(3). 


<400> 


14 


cc tac 


cag ( 



(113) 



Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe Bxs Arg Tyr 
15 10 15 

cag ttc ttfc gtc tat ctt cag gaa gga aaa gtc ate tct etc ctt ccc 
Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val lie Ser Leu Leu Pro 
20 25 30 

aag gaa aac aaa act cga g 
Lys Glu Asn Lys Thr Arg 
35 



47 



95 



114 



<210> 15 

50 <211> 269 

<212> DNA 

<213> Homo Sapiens 



55 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1).. (269) 

exon 6 for seqID 3 and seqID 4 



<220> 
60 <221> CDS 

<222> (3).. (15b) 

<400> 15 

gc tct tgg aaa acg gac aga ttt ctg aac cgc ttc cac ctg ggc gaa 
65 Ser Trp Lys Met Asp Arg Phe Leu Asn Arg Phe His Leu Gly Glu 

1 5 10 15 

cct gaa gca age acr cag ttc atg acc cag aac tac cag gac tea cca 
Pro Glu Ala Ser Thr Gin Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro 
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55 
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acc etc cag get ccc aga gga agg gec age gag ccc aag cac aaa ace 
Thr Leu Gin Ala Pro Arg Gly Arg Ala Ser Glu Pro Lys His Lys Thr 
35 ^ 40 45 

agg egg aga tag ctgcctgcta gatagcegge tttgccatcc gggcatgtgg 
Arg Arg Arg 
50 

ccacactgct caccaccgac gatgtgggta tggaaccccc tctggataca gaaccccttc 
ttttccaaat taaa 



<210> 16 

<211> 269 

<212> DNA 

<213> Homo Sapiens 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 



misc feature 
(1) .7(269) 

exon 6 for seqID 1 and seqXO 2 



CDS 

(3) (164) 



<400> 16 

gc tct tgg aaa atg gac aga ttt ctg aac cgt ttc cac ctg ggc gaa 
Ser Trp Lys Met Asp Arg Phe Leu Asn Arg Phe Has Leu Gly Glu 
15 10 15 

cct gaa gca age acc cag ttc atg acc cag aac tac cag gac tea cca 
Pro Glu Ala Ser Thr Gin Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro 
20 2b 30 

acc etc cag get ccc aga gaa agg gee age gag ccc aag cac aaa aac 
Thr Leu Gin Ala Pro Arg Glu Arg Ala Ser Glu Pro Lys His Lys Asn 
35 40 45 

cag gcg gag ata get gee tgc tagatagecg getttgecat ccgggcatgt 
Gin Ala Glu He Ala Ala Cys 
50 

ggccacaetq cccaccaccg acgatgtggg tatggaaccc cctctggata cagaacccct 
tcttttccaa ataaa 



<210> 17 

<211> 624 

<212> DNA 

<213> Homo Sapiens 
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269 
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<220> 
<221> 
<222> 
<223> 

<220> 
<22l> 
<222> 
<223> 



wise feature 
(1) .7(624) 

public sequence DM984582 from dbEST (1..624 c) 



COS 

(2) .(487) 

seqID 1 (67. .227) 

seqID 2 (45. .205) 



<400> 17 

c tac aga cag aag ate acc tec tgg atg qaq ccg ata gtc aag ttc ccg 



49 



121 



£*Q3<SW*--S7*i& .0708DS 



10 



15 



Tyr Arg Gin Lys lie Thr Ser Trp Met Glu Pro He Val Lys Phe Pro 

15 10 15 

ggg gcc gtg gac ggc gca acc tat ate ctg gtg atg gtg gat cca gat 

Gly Ala Val Asp Gly Ala Thr Tyr He Leu Val Met Val Asp Pro Asp 
20 25 30 

gcc cct age aga gca gaa ccc aga cag aga ttc tgg aga cat tgg ctg 

Ala Pro Ser Arg Ala Glu Pro Arg Gin Arg Phe Trp Arg His Trp Leu 
35 40 45 

gta aca gat ate aag ggc gcc gac ctg aag gaa ggg aag att cag ggc 

Val Thr Asp lie Lys Gly Ala Asp Leu Lys Glu Gly Lys He Gin Gly 
50 55 60 

cag gag tta tea gcc tac cag get ccc tec cca ccg gca cac agt ggc 

Gin Glu Leu Ser Ala Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly 

65 70 75 80 



97 



145 



193 



241 



20 ttc cat cgc tac cag ttc ttt gtc tat ctt cag gaa qga aaa gtc ate 
Phe His Arg Tyr Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val He 
85 90 95 

tct etc ctt ccc aag gaa aac aaa act cga ggc tct tgg aaa atg gac 
25 Ser Lou Leu Pro Lys Glu Asn Lys Thr Arg Gly Ser Trp Lys Met Asp 
100 105 110 

aga ttt ttg aac cgt ttc cac ctg ggc gaa cct gaa gca age acc cag 
Arg Phe Leu Asn Arg Phe His Leu Gly Glu Pro Glu Ala Ser Thr Gin 
30 115 120 125 

ttc atg acc cag aac tac cag gac tea cca acc etc cag get ccc aga 
Phe Met Thr Gin Asn Tyr Gin Asp Ser Pro Thr Leu Gin Ala Pro Arg 
130 135 140 

gaa agg gcc age gag ccc aag cac aaa aac cag qcq gag ata get gcc 
Glu Arg Ala Ser Glu Pro Lys His Lys Asn Gin Ala Glu He Ala Ala 
145 150 155 160 

40 tgc tag atagcegget ttgccatccg ggcatgtggc cacactgccc accaccgacg 
Cys 



35 



289 



337 



385 



433 



481 



537 



45 



atgtgggtat ggaaccccct ctggatacag aaccccttct tttccaaata aaaaaaaaat 597 
catccaggga aaaaaaaaaa aaaaaaa 624 



50 



<210> 18 

<211> 547 

<212> DNA 

<?13> Homo Sapiens 



55 



<220> 
<221> 
<222> 
<?23> 



misc feature 
(1) .7(547) 

public sequence AI857902 from dbEST (1. .547 c) 



60 



<220> 

<221> COS 
<222> 
<223> 



(1)..<414) 
seqID 3 (87. ,223) 
scqTD 4 (65.. 223) 



65 



<400> 18 

ggc gca acc tat ate ttg gtg atg gLg qat cca gat gcc cct age aga 

Gly Ala Thr Tyr He Leu Val Met Val Asp Pro Asp Ala Pro Ser Arg 
15 10 15 



48 



122 



gca gaa ccc aga cag aga ttc tgg aga cat tgg ctg gta aca gat ate 96 
Ala Glu Pro Arg Gin Arg Phe Trp Arg Hxs Trp Leu Val Thr Asp He 
20 25 30 

5 aag ggc gec gac ctg aag gaa ggg aag att cag ggc cag gag tta tea 144 
Lys Gly Ala Asp Leu Lys Glu Gly Lys He Gin Gly Gin Glu Leu Ser 
35 40 45 

gec tac cag get ccc tec cca ccg gca cac agt ggc ttc cat cgc tac 192 
10 Ala Tyr Gin Ala Pro Ser Pro Pro Ala His Ser Gly Phe His Arg Tyr 
50 55 60 

cag ttc ttt gtc tat ctt cag gaa gga aaa gtc ate tct etc ctt ccc 240 
Gin Phe Phe Val Tyr Leu Gin Glu Gly Lys Val He Ser Leu Leu Pro 
15 65 70 75 80 

aag gaa aac aaa act cga ggc tct tgg aaa atg gac aga ttt ctg aac 200 

Lys Glu Asn Lys Thr Arg Gly Ser Trp Lys Met Asp Arg Phe Leu Asn 
85 90 95 

20 

cgc ttc cac ctg ggc gaa cct gaa gca age acc cag ttc atg ace cag 336 

Arg Phe His Leu Gly Glu Pro Glu Ala Ser Thr Gin Phe Met Thr Gin 
100 105 HO 

25 aac tac cag gac tea cca acc etc cag get ccc aga gga agg gec age 384 
Asn Tyr Gin Asp Ser Pro Thr Leu Gin Ala Pro Arg Gly Arg Ala Ser 
115 120 125 

gag ccc aag cac aaa acc agg egg aga tag ctgcctgcta gatagcegge 434 
30 Glu Pro Lys Has Lys Thr Arg Arg Arg 
130 135 

tttgccatcc gggcatgtgg ccacactgct caccaccgac gatgtgggta tggaaccccc 4 94 

35 tttggataca gaaccccttc ttttccaaat taaaaaaaaa aatcatccag gaa 547 
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